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Abstract 

We study the problem of probabilistic deduction with conditional constraints over basic 
events. We show that globally complete probabilistic deduction with conditional constraints 
over basic events is NP-hard. We then concentrate on the special case of probabilistic 
deduction in conditional constraint trees. We elaborate very efficient techniques for globally 
complete probabilistic deduction. In detail, for conditional constraint trees with point 
probabilities, we present a local approach to globally complete probabilistic deduction, 
which runs in linear time in the size of the conditional constraint trees. For conditional 
constraint trees with interval probabilities, we show that globally complete probabilistic 
deduction can be done in a global approach by solving nonlinear programs. We show how 
these nonlinear programs can be transformed into equivalent linear programs, which are 
solvable in polynomial time in the size of the conditional constraint trees. 

1. Introduction 

Dealing with uncertain knowledge plays an important role in knowledge representation and 
reasoning. There are many different formalisms and methodologies for handling uncertainty. 
Most of them are directly or indirectly based on probability theory. 

In this paper, we focus on probabilistic deduction with conditional constraints over basic 
events (that is, interval restrictions for conditional probabilities of elementary events). The 
considered probabilistic deduction problems consist of a probabilistic knowledge base and 
a probabilistic query. We give a classical example. As a probabilistic knowledge base, we 
may take the probabilistic knowledge that all ostriches are birds, that the probability of 
Tweety being a bird is greater than 0.90, and that the probability of Tweety being an ostrich 
provided she is a bird is greater than 0.80. As a probabilistic query, we may now wonder 
about the entailed greatest lower and least upper bound for the probability that Tweety 
is an ostrich. The solution to this probabilistic deduction problem is 0.72 for the entailed 
greatest lower bound and 1.00 for the entailed least upper bound. 

More generally, probabilistic deduction with conditional constraints over propositional 
events can be done in a global approach by linear programming or in a local approach by 
the iterative application of inference rules. Note that it is immediately NP-hard, since it 
generalizes the satisfiability problem for classical propositional logic (see Section 2.2). 

Research on the global approach spread in particular after the important work on prob- 
abilistic logic by Nilsson (1986) (see also the work by Paafi, 1988). The main focus was 
on analyzing the computational complexity of satisfiability and entailment in probabilis- 
tic logic and on developing efficient linear programming algorithms for these problems. 
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Georgakopoulos et al. (1988) show that the satisfiabihty problem in probabilistic logic is 
NP-complete and propose to apply column generation techniques for its processing. This 
approach was further developed by Kavvadias and Papadimitriou (1990), Jaumard et al. 
(1991), Andersen and Hooker (1994), and Hansen et al. (1995). In particular, Jaumard et 
al. (1991) report promising experimental results on the efficiency in special cases of prob- 
abilistic satisfiability and entailment. Moreover, Kavvadias and Papadimitriou (1990) and 
Jaumard et al. (1991) identify special cases of probabilistic satisfiability that can be solved 
in polynomial time. Other work on the global approach concentrates on reducing the num- 
ber of linear constraints (Luo et al. 1996) and the number of variables (Lukasiewicz, 1997). 
Finally, Fagin et al. (1992) present a sound and complete axiom system for reasoning about 
probabilities that are expressed by linear inequalities over propositional events. They show 
that the satisfiability problem in this quite expressive framework is still NP-complete. 

In early work, Dubois and Prade (1988) use inference rules to model default reason- 
ing with imprecise numerical and fuzzy quantifiers. For this reason, subsequent research 
on inference rules especially aims at analyzing patterns of human commonsense reasoning 
(Dubois et al. 1990, 1993; Amarger et al. 1991; Thone, 1994; Thone et al. 1995). Frisch 
and Haddawy (1994) discuss the use of inference rules for deduction in probabilistic logic. 
Recent work on inference rules concentrates on integrating probabilistic knowledge into de- 
scription logics (Heinsohn, 1994) and on analyzing the interplay between taxonomic and 
probabilistic deduction (Lukasiewicz 1998a, 1999a). 

We now summarize the main characteristics of the global and the local approach. 

The global approach can be performed within quite rich probabilistic languages (Fagin 
et al., 1992). Crucially, probabilistic deduction by linear programming is globally complete 
(that is, it really provides the requested tightest bounds entailed by the whole probabilistic 
knowledge base). However, a main drawback of the global approach is that it generally does 
not provide useful explanatory information on the deduction process. Finally, results on 
the special-case tractability of global approaches are driven by the technical possibilities of 
linear programming techniques and not by the needs of artificial intelligence applications. 
Hence, they do not seem to be very useful in the artificial intelligence context. 

A main advantage of the local approach is that the deduced results can be explained 
in a natural way by the sequence of applied inference rules (Amarger et al. 1991; Frisch 
& Haddawy, 1994). However, the iterative application of inference rules is generally re- 
stricted to quite narrow probabilistic languages. Moreover, it is very rarely and only within 
very restricted languages globally complete (Frisch and Haddawy, 1994, give an example 
of globally complete local probabilistic deduction in a very restricted framework). Finally, 
as far as the computational complexity is concerned, there are very few experimental and 
theoretical results on the special-case tractability of local approaches. 

The main motivating idea of this paper is to elaborate efficient local techniques for 
globally complete probabilistic deduction. Inspired by previous work on inference rules, we 
focus our research on the language of conditional constraints over basic events: 

Dubois and Prade (1988) study the chaining of two bidirectional conditional constraints 
over basic events ("quantified syllogism rule") and some of its special cases. Dubois et 
al. (1990) additionally discuss probabilistic deductions about conjunctions of basic events. 
Furthermore, they describe the open problem of probabilistic deduction along a chain of 
more than two bidirectional conditional constraints over basic events. In later work, Dubois 
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et al. (1993) use a qualitative version of the "quantified syllogism rule" in an approach to 
reasoning with linguistic quantifiers. Amarger et al. (1991) propose to apply the "quan- 
tified syllogism rule" and the "generalized Bayes' rule" to sets of bidirectional conditional 
constraints over basic events. They report promising experimental results on the global com- 
pleteness and the computational complexity of the presented deduction technique. However, 
this deduction technique is generally not globally complete. Thone (1994) examines trees of 
bidirectional conditional constraints over basic events. He presents a linear-time deduction 
technique that is based on a system of inference rules and that computes certain logically 
entailed greatest lower bounds (in the technical notions of this paper, which will be defined 
below, tight lower answers to conclusion-restricted queries are computed). 

As a first contribution of this paper, we show that globally complete probabilistic de- 
duction with conditional constraints over basic events is NP-hard. It is surprising that this 
quite restricted class of probabilistic deduction problems is still computationally so diffi- 
cult. Hence, it is unlikely that there is an algorithm that efficiently solves all probabilistic 
deduction problems with conditional constraints over basic events. However, we can still 
hope that there are efficient special-case, average-case, or approximation algorithms. 

In this paper, we then elaborate efficient special-case algorithms. In detail, we concen- 
trate on probabilistic deduction in conditional constraint trees. It is an interesting sub- 
class of all probabilistic deduction problems with conditional constraints over basic events. 
Conditional constraint trees are undirected trees with basic events as nodes and with bidi- 
rectional conditional constraints over basic events as edges between the nodes (that is, 
deduction in conditional constraint trees is a generalization of deduction along a chain of 
bidirectional conditional constraints over basic events). Like Bayesian networks, conditional 
constraint trees represent a well-structured probabilistic knowledge base. Differently from 
Bayesian networks, they do not encode any probabilistic independencies. 

As a main contribution of this paper, we have the following results. For conditional con- 
straint trees with point probabilities, we present functions for deducing greatest lower and 
least upper bounds in linear time in the size of the conditional constraint trees. Moreover, for 
conditional constraint trees with interval probabilities, we show that greatest lower bounds 
can be deduced in the same way, in linear time in the size of the conditional constraint trees. 
However, computing least upper bounds turns out to be computationally more difficult. It 
can be done by solving special nonlinear programs. We show how these nonlinear programs 
can be transformed into equivalent linear programs. The resulting linear programs have a 
number of variables and inequalities linear and polynomial, respectively, in the size of the 
conditional constraint trees. Thus, our way of deducing least upper bounds still runs in 
polynomial time in the size of the conditional constraint trees, since linear programming 
runs in polynomial time in the size of the linear programs. 

Another important contribution of this paper is related to the question whether to 
perform probabilistic deduction with conditional constraints by the iterative application of 
inference rules or by linear programming. On the one hand, the idea of inference rules carries 
us to very efficient techniques for globally complete probabilistic deduction in conditional 
constraint trees. In particular, the considered deduction problems generalize patterns of 
commonsense reasoning. However, on the other hand, the corresponding proofs of soundness 
and global completeness are technically quite complex. Hence, it seems unlikely that the 
results of this work can be extended to significantly more general probabilistic deduction 
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problems. Note that a companion paper (1998a, 1999a) reports similar limits of the local 
approach in probabilistic deduction under taxonomic knowledge. 

The rest of this paper is organized as follows. In Section 2, we formulate the proba- 
bilistic deduction problems considered in this work. Section 3 focuses on the probabilistic 
satisfiability of conditional constraint trees. Section 4 deals with globally complete proba- 
bilistic deduction in exact and general conditional constraint trees. In Section 5, we give a 
comparison with Bayesian networks. Section 6 summarizes the main results of this work. 

2. Formulating the Probabilistic Deduction Problem 

In this section, we introduce the syntactic and semantic notions related to probabilistic 
knowledge in general and to conditional constraint trees in particular. 

2.1 Probabilistic Knowledge 

Before focusing on the details of conditional constraint trees, we give a general introduction 
to the kind of probabilistic knowledge considered in this work. We deal with conditional 
constraints over propositional events. They represent interval restrictions for conditional 
probabilities of propositional events. Note that the formal background introduced in this 
section is commonly accepted in the literature (see especially the work by Prisch and Had- 
dawy, 1994, for other work in the same spirit). 

We assume a nonempty and finite set of basic events B = {Bi,B2, . . . , B^}. The set of 
conjunctive events Cg is the closure of B under the Boolean operation A. We abbreviate the 
conjunctive event C A-D by CD. The set oi propositional events Qb is the closure of B under 
the Boolean operations A and -■. We abbreviate the propositional events G A H and -iG 
by GH and G, respectively. The false event Bi A -iSi and the true event -'{Bi A -'Bi) are 
abbreviated by _L and T, respectively. Conditional constraints are expressions of the form 
{H\G)[ui,U2] with real numbers ui,U2 G [0,1] and propositional events G and H. In the 
conditional constraint {H\G)[ui,U2], we call G the premise and H the conclusion. 

To define probabilistic interpretations of propositional events and of conditional con- 
straints, we introduce atomic events and the binary relation ^ between atomic and propo- 
sitional events. The set of atomic events Ab is defined by Ab = {EjEs ■■■ En\Ei = Bi 
or E-i = B-i for all i E [1 Note that each atomic event can be interpreted as a possible 
world (which corresponds to a mapping from B to {true, false}). For all atomic events A 
and all propositional events G, let ^4 ^ G iff AG is a propositional contradiction. 

A probabilistic interpretation Pr is a mapping from Ab to [0, 1] such that all Pr{A) with 
A G Ab sum up to 1. Pr is extended in a well-defined way to propositional events G by: 
Pr{G) is the sum of all Pr{A) with A G Ab and A ^ G. Pr is extended to conditional 
constraints by: Pr |= {H\G)[ui,U2] iflF iti ■ Pr{G) < Pr{GH) < U2 ■ Pr{G). 

Note that conditional constraints characterize conditional probabilities of events, rather 
than probabilities of conditional events (Coletti, 1994; Gilio & Scozzafava, 1994). Note also 
that Pr{G) = always entails Pr |= {H\G)[ui,U2]. This semantics of conditional probability 
statements is also assumed by Halpern (1990) and by Prisch and Haddawy (1994). 

The notions of models, satisfiability, and logical consequence for conditional constraints 
are defined in the classical way. A probabilistic interpretation Pr is a model of a conditional 
constraint (if|G)[iti, 1*2] iff Pr \= {H\G)[ui,U2]. Pr is a model of a set of conditional 
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constraints KB, denoted Pr |= KB, iff Pr is a model of all {H\G)[ui,U2] G KB. KB is 
satisfiable iff a model of KB exists. {H\G)[ui,U2] is a logical consequence of KB, denoted 
KB 1= {H\G)[ui,U2], iff each model of KB is also a model of {H\G)[ui,U2]- 

For a conditional constraint {H\G)[ui,U2] and a set of conditional constraints KB, let 
It denote the set of all real numbers u G [0, 1] for which there exists a model Pr of KB with 
u ■ Pr{G) = Pr{GH) and Pr{G) > 0. Now, we easily verify that {H\G)[ui,U2] is a logical 
consequence of KB iff ui < infix and U2 > sup it. 

This observation yields a canonical notion of tightness for logical consequences of con- 
ditional constraints. The conditional constraint {H\G)[ui,U2] is a tight logical consequence 
of KB, denoted KB \=t^ght {H\G)[ui,U2\, iff ui = inf m and U2 = sup it. 

The set u is a closed interval in the real line (Frisch & Haddawy, 1994). Note that for 
tt = 0, we canonically define inf tt = max [0, 1] = 1 and sup it = min [0, 1] = 0. Thus, tt = 
mKB ^ (G|T)[0,0] mKB ^Hg^t {H\G)[l,Q] ^ KB ^ {H\G)[ui,U2\ for all ui,U2 G [0,1]. 

Based on the just introduced notion of tight logical consequence, probabilistic deduction 
problems and their solutions are more formally specified as follows. 

A probabilistic knowledge base {B,KB) consists of a set of basic events B and a set of 
conditional constraints KB over with ui < U2 for all {H\G)[ui,U2\ G KB. A probabilistic 
query to a probabilistic knowledge base {B, KB) is an expression of the form 3{F\E)[xi,X2\ 
with E,F E Qb and two different variables xi and X2- Its tight answer is the substitution 
a = {xi/ui, X2/U2} with ui,U2 G [0,1] such that KB \=tigfii {F\E)[ui,U2] (we call o"i = 
{xi/ui} the tight lower answer and a2 = {X2/U2} the tight upper answer). A correct answer 
is a substitution a = {xi/ui, X2/U2} with ui,U2 G [0, 1] such that KB \= {F\E)[ui,U2\. 

Finally, we define the notions of soundness and of completeness related to inference 
rules and to techniques for probabilistic deduction. An inference rule KB h {H\G)[ui,U2\ is 
sound iff KB \= {H\G)[ui,U2], where {H\G)[ui,U2\ is a conditional constraint and KB is a 
set of conditional constraints. It is sound and locally complete iff KB \=tigfii {H\G)[ui,U2]. 
A technique for probabilistic deduction is sound for a set of probabilistic queries Q iff it com- 
putes a correct answer to any given query from Q. It is sound and globally complete for Q 
iff it computes the tight answer to any given query from Q. 

2.2 Computational Complexity 

In the framework of conditional constraints over propositional events, the optimization prob- 
lem of computing the tight answer to a probabilistic query is immediately NP-hard, since it 
generalizes the satisfiability problem for classical propositional logic (the NP-complete prob- 
lem of deciding whether a propositional formula in conjunctive normal form is satisfiable; 
see especially the survey by Garey and Johnson, 1979). 

Surprisingly, the optimization problem of computing the tight answer to a probabilistic 
query remains NP-hard even if we just consider conditional constraints over basic events: 

Theorem 2.1 The optimization problem of computing the tight answer to a probabilistic 
query over basic events that is directed to a probabilistic knowledge base over basic events 
is NP-hard. 

Proof. The NP-complete decision problem of graph 3-colorability (Garey & Johnson, 1979) 
can be polynomially-reduced to the optimization problem of computing the tight answer 
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to a probabilistic query over basic events that is directed to a probabilistic knowledge base 
over basic events. The proof follows similar lines to the proof of NP-hardness of 2PSAT 
given by Georgakopoulos et al. (1988). 

Let [V, E) be a finite undirected graph. We construct a probabilistic knowledge base 
{B^ KB) as follows. We initialize {B, KB) with ({.B}, 0). For each node u G F, we increase 
B by the new basic events D},, B^,, and Bf,. For each node v E V and for each i E {1, 2, 3}, 
we increase KB by 1] and (i?* |i?)[l/3, 1/3]. For each node u G F and for each 

i,j G {1, 2, 3} with i < j, we increase KB by (-B^|i?*)[0, 0]. For each edge {u, v} G E and for 
each i G {1,2,3}, we increase KB by (5* |S^)[0, 0]. It is easy to see that the probabilistic 
knowledge base {B, KB) can be constructed in polynomial time in the size of {V, E). 

Now, we show that {V,E) is 3-colorable iff {.ti/I, 2:2/1} is the tight answer to the 
probabilistic query 3{B\B)[xi,X2\ to {B,KB), or equivalently, iS KB is satisfiable: 

If {V,E) is 3-colorable, then there exists a mapping ci from V to {1,2,3} with ci{u) / 
ci{v) for all edges {it, v} G E. Thus, if tt is a cyclic permutation of the members in {1, 2, 3} 
and if C2,C3 : V — > {1,2,3} are defined by C2{v) = it{ci{v)) and cs{v) = tt{c2{v)) for all 
nodes v E V, then also C2{u) ^ C2{v) and cs{u) 7^ cs{v) for all edges {u,v} G E. For 
j G {1, 2, 3}, let Aj G Ab such that Aj ^ B and Aj =^ B'^ iff Cj{v) = i for all nodes v eV 
and i G {1,2,3}. UPriAs [0,1] is defined by Pr{A) = 1/3 for all A G {^1,^2,^3} and 
by Pr{A) = for all A^AbX {^i: ^2- A3}, then Pr is a model of KB. 

Conversely, if there is a model Pr of KB, then there is an atomic event A G As with 
Pr{A) > 0. Thus, if c: F {1, 2, 3} is defined by c{v) = i iS A ^ B^ for all nodes « G F, 
then c{u) / c{v) for all edges {it, v} G E. Hence, (F, is 3-colorable. □ 

Hence, it is unlikely that there is an efficient algorithm for computing the tight answer 
to all probabilistic queries over basic events that are directed to any given probabilistic 
knowledge base over basic events. However, there may still be efficient algorithms for 
solving more specialized probabilistic deduction problems. 

The rest of this work deals with probabilistic deduction in conditional constraint trees. 
The next section provides a motivating example, which gives evidence of the practical 
importance of this kind of probabilistic deduction problems. 

2.3 Motivating Example 

A senior student in mathematics describes her experience about being successful at the uni- 
versity as follows. The success of a student (su) is influenced by how well-informed (wi) and 
how well-prepared (wp) the student is. Well-informedness can be reached by interviewing 
professors (pr) or by asking senior students (st). Being well-prepared is influenced by how 
much time is invested in books (bo), exercises (ex), and hobbies (ho). 

It is estimated that the probability of a student being successful given she is well- 
informed lies between 0.60 and 0.70, that the probability of a student being well-informed 
given she is successful is greater than 0.85, that the probability of a student being successful 
given she is well-prepared is greater than 0.95, and that the probability of a student being 
well-prepared given she is successful is greater than 0.95. 

This probabilistic knowledge completed by further probabilistic estimations is given by 
the probabilistic knowledge base {B, KB) in Fig. 1, where B is the set of nodes {su, wi, wp, pr. 
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St, bo, ex, ho} and KB is the least set of conditional constraints that contains (y |X)[iti, it2] 
for each arrow from X to y labeled with ui,U2. 



St ho 




bo 



Figure 1: A Conditional Constraint Tree 

We may wonder whether it is useful for being successful at the university to interview the 
professors, to study on books, to spend the time on one's hobbies, or to do both studying 
on books and spending the time on one's hobbies. This can be expressed by the prob- 
abilistic queries 3(su|pr)[a;i, 2:2], 3(su|bo)[2;i, ^2], 3(su|ho)[a;i, 2:2], and 3(su|bo ho)[2;i, 2:2], 
which yield the tight answers {a;i/0.00, a;2/1.00}, {a;i/0.90, a;2/1.00}, {a;i/0.30, a;2/0.46}, 
and {a;i/0.71, .2;2/1.00}, respectively. 

We may wonder whether successful students at the university interviewed their profes- 
sors, whether they studied on books, whether they spent their time with their hobbies, or 
whether they both studied on books and spent their time with their hobbies. This can be 
expressed by the probabilistic queries 3(pr|su)[a;i, 2:2], 3(bo|su)[a;i, 2:2], 3(ho|su)[a:i, 2:2], and 
3(bo ho|su)[a;i, 2:2], which yield the tight answers {2;i/0.00, 2:2/0.17}, {2:1/0. 90, 2^2/1.00}, 
{2:1/0.30, 2:2/0.45}, and {2:1/0.25,2:2/0.45}, respectively. 

2.4 Conditional Constraint Trees 

We formally define conditional constraint trees and queries to conditional constraint trees. 
We provide some additional examples, which are subsequently used as running examples. 

A (general) conditional constraint tree is a probabilistic knowledge base {B,KB) for 
which an undirected tree (a singly connected undirected graph) (S, -f^) exists such that 
KB contains exactly one pair of conditional constraints (i?|^)[Mi, W2] and 1, ^72] with 

ui^vi > for each pair of adjacent nodes A and B (note that B = {B} implies KB = 0). 
A basic event B E B is called a leaf in {B, KB) iff it has exactly one neighbor in (B, -f-)-). 
A conditional constraint tree is exact iff ui = U2 for all (5|^)[iti, it2] G KB. 

A query to a conditional constraint tree is a probabilistic query 3(F|_E)[2;i, 2:2] with two 
conjunctive events E and F that are disjoint in their basic events and such that all paths 
from a basic event in i? to a basic event in F have at least one basic event in common. 
A query 3(F|£^)[2;i, 2:2] to a conditional constraint tree is premise-restricted iff £^ is a basic 
event. It is conclusion-restricted iff F is a basic event. It is strongly conclusion-restricted 
iff F is the only basic event that is contained in all paths from a basic event in E to F. 
It is complete iff EF contains exactly the leaves of {B, •<->■). 
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Fig. 2 shows two conditional constraint trees of which the one on the left side is exact. 
3(STU|MNQR)[.Ti. .T2] is a query, while 3(MS|QU)[2;i, 2:2] is not a query to the conditional 
constraint trees of Fig. 2. Furthermore, 3(STU|M)[a;i, 2:2] is a premise-restricted query, 
3(0|QRSTU)[a;i, 2:2] a strongly conclusion-restricted query, and 3(QRSTU|M)[rz;i, rz;2] a prem- 
ise-restricted complete query to the conditional constraint trees of Fig. 2. 



1) T 2) T 




R 



Figure 2: Two Conditional Constraint Trees 

For conditional constraint trees {B,KB), conjunctive events C, and basic events B, we 
write C ^ B iS there exists a path Gi, G2, . . . , Gk from a basic event Gi in C to the basic 
event Gk = B such that {Gi+i\Gi)[l, 1] G KB for all i E [1 : k - 1]. We write i? ^ C iff for 
all paths Gi,G2, ■ ■ ■ ,Gk from the basic event Gi = B to a basic event Gk in C, it holds 
{Gi+i\Gi)[l,l]eKB for all i G [1 : A; - 1]. That is, the conditions C^B and B^C 
immediately entail KB |= {B\C)[1, 1] and KB ^ {C\B)[1, 1], respectively. 

Note that the restriction iti, > for all 1/2], (^|-B)[wi, ^2] GiT^isjust made 

for technical convenience. The deduction technique of Section 4 can easily be generalized 
to conditional constraint trees (B, KB) that satisfy only the restriction iti > iff i^i > for 
all {B\A)[ui,U2],{A\B)[vi,V2] G KB (Lukasiewicz, 1996). 

The restriction that for each query 3{F\E)[xi,X2], all paths from a basic event in E 
to a basic event in F have at least one basic event in common is crucial for the deduction 
technique of Section 4. It assures that the problem of computing the tight answer to a 
complete query can be reduced to the problems of computing the tight answer to a premise- 
restricted complete query and the tight answer to a strongly conclusion-restricted complete 
query. Note that this restriction is trivially satisfied by all premise- and conclusion-restricted 
queries (for example, by all the queries in Section 2.3). 

Especially tight answers to conclusion-restricted queries seem to be quite important in 
practice. They may be used to characterize the probability of uncertain basic events given 
a collection of basic events that are known with certainty. 
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3. Probabilistic Satisfiability 

In this section, we show that conditional constraint trees have the nice property that they 
are always satisfiable. That is, within conditional constraint trees, the user is prevented 
from specifying inconsistent probabilistic knowledge. 

First, note that conditional constraint trees always have a trivial model in which the 
probability of the conjunction of all negated basic events is one and in which the probability 
of all the other atomic events is zero. 

The next lemma shows that, given a model Pr of a conditional constraint tree and a 
real number s from [0, 1], we can construct a new model Pr^ by setting Prs{A) = s ■ Pr{A) 
for all atomic events A that are different from the conjunction of all negated basic events. 
Note that Pro coincides with the trivial model and that Pri is identical to Pr. This lemma 
is crucial for inductively constructing models of conditional constraint trees. 

Lemma 3.1 Let {B,KB) be a conditional constraint tree with B = {Bi,B2,. ■ ■ ,Bn}. Let 

Pr be a model of KB and let s be a real number from [0, 1]. 

The mapping Prs : Ab ^ [0, 1] with Prs{A) = s ■ Pr{A) for all AeAsX {^1^2 ■ --Bn} 
and Prs(BiB2 ■ ■ ■ B^) = s ■ Pr(B{B2 ■ ■ ■ Bn) -s + 1 is a model of KB. 

Proof. We easily verify that Prg is a probabilistic interpretation. It remains to show that 
Prs is also a model of KB. Let {H\G)[ui,U2] G KB. Since Pr is a model of KB, we have 
Pr \= {H\G)[ui,U2], hence ui ■ Pr{G) < Pr{GH) < U2 ■ Pr{G), and thus also uis ■ Pr{G) < 
s ■ Pr{GH) < U2S ■ Pr{G). Since neither B1B2 ■ ■ -B^ ^ G noi B1B2 ■ ■ -B^ ^ GH, we get 
ui ■ Prs{G) < Prs{GH) < U2 ■ Prs{G) and thus Pr, \= {H\G)[ui,U2]. □ 

Finally, the following theorem shows that conditional constraint trees always have a 
nontrivial model in which all the basic events have a probability greater than zero. 

Theorem 3.2 Let {B, KB) be a conditional constraint tree with B = {Bi,B2, . . . ,Bn}. 
There is a model Pr of KB with Pr{BiB2 ■ ■ ■ S„) > 0. 

Proof. It is sufficient to show the claim for exact conditional constraint trees. The claim 
is proved by induction on the number of basic events. 

Basis: for {B, KB) = ({S}, 0), a model Pr of KB with Pr{B) > is given by S, 5 H- 0, 1 
(note that S, S 1-^ 0, 1 is an abbreviation for Pr{B) = and Pr{B) = 1). 

Induction: let (B, KB) = {Bi U B2, KBi U KB2) with two exact conditional constraint trees 
{BuKBi) = {{B,C},{{C\B)[uMAB\C)[v,v]}) and {B2,KB2) = {{C, D,, . . . , Dk}, KB2) 
such that Bir\B2 = {C}. A model Pn of KBi with Pri{BC) > is given by: 



BC,BC,BC,BC ^ -jf-, 



v—uv u—uv uv 



By the induction hypothesis, there is a model Pr2 of KB2 (that is defined on the atomic 
events over B2) with Pr2{CDi ■ ■ ■ D^) > 0. By Lemma 3.1, we can assume Pr2{C) = Pri(C). 
A probabilistic interpretation Pr on the atomic events over B is now defined by: 

rr(AbAcA2) - PniAc) 
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for all atomic events Ah, Ac, and A2 over {B}, {C}, and B2 \ {C}, respectively. We easily 
verify that Pr{Ai,Ac) = Pri{Ai,Ac) and Pr{AcA2) = Pr2{AcA2) for all atomic events 
Af), Ac, and A2 over {B}, {C}, and B2 \ {C}, respectively. Hence, Pr is a model of KB. 
Moreover, Pri{BC) > and Pr2{CDi • • • D^) > entails Pr{BCDi ■ ■ ■ Dk) > 0. □ 

4. Probabilistic Deduction 

In this section, we present techniques for computing tight answers to queries directed to 
exact and general conditional constraint trees, and we analyze their computational com- 
plexity. More precisely, the problem of computing the tight answer to a query is reduced to 
the problem of computing the tight answer to a complete query. The latter problem is then 
reduced to the problems of computing the tight answer to a premise-restricted complete 
query and the tight answer to a strongly conclusion-restricted complete query. 

4.1 Premise-Restricted Complete Queries 
4.1.1 Exact Conditional Constraint Trees 

We now focus on the problem of computing tight answers to premise-restricted complete 
queries that are directed to exact conditional constraint trees. 

Let {B, KB) be an exact conditional constraint tree and let 3{F\E)[xi,X2] be a premise- 
restricted complete query. To compute the tight answer to B{F\E)[xi,X2], we start by 
defining a directed tree (that is, a directed acyclic graph in which each node has exactly 
one parent, except for the root, which does not have any): 

A ^ B iff A B and A is closer to E than B. 

This directed tree {B, — >) is uniquely determined by the conditional constraint tree and the 
premise-restricted complete query. Fig. 3 shows (B, — >) for the premise-restricted complete 
query 3(QRSTU|M)[a;i, 2:2] to the exact conditional constraint tree in Fig. 2, left side. 

Now, the set of nodes B is partitioned into several strata. The lowest stratum contains 
only nodes with no children in (B, — )■), the highest stratum contains the nodes with no 
parents in (B, — >) (that is, exactly the node of the premise E of the query). Fig. 3 also 
shows the different strata in our example. 

At each node of {B, -^), we compute certain tightest bounds that are logically entailed 
by KB. More precisely, the tightest bounds at a node B are computed locally, by exploiting 
the tightest bounds that have previously been computed at the children of B. Hence, we 
iteratively compute the tightest bounds at the nodes of each stratum, starting with the 
nodes of the lowest stratum and terminating with the nodes of the highest stratum. We 
distinguish three different ways of computing tightest bounds at a node: 

• initialization of a leaf (Leaf), 

• chaining of an arrow and a subtree via a common node (Chaining), 

• fusion of subtrees via a common node (FUSION). 

Let us consider again the premise-restricted complete query 3(QRSTU|M)[a;i, 2:2] to the 
exact conditional constraint tree in Fig. 2, left side. Fig. 4 illustrates the three different ways 
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R 



4 3 2 1 strata 

Figure 3: Directed Tree (S, 

of computing tightest bounds at a node (the common nodes for Chaining and Fusion are 
filled black). Table 1 shows the greatest lower and the least upper bounds that are computed 
at each node D of each stratum. More precisely, these bounds are ai = inf Pr{BD)/ Pr{B), 
a2 = sup Pr{BD)/Pr{B), j32 = sup Pr(BD)/ Pr{B), and 72 = sup Pr{D)/ Pr{B) subject to 
Pr \= KB and Pr{B) > 0. Table 1 also shows the requested tight answer {xi/0.02, 2:2/0.17}, 
which is given by the tightest bounds ai and a2 that are computed at the premise M. 
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Initialization of a leaf: 



P 




Chaining of an arrow and a subtree: 



. P 




Fusion of subtrees: 



.•■ P 




Figure 4: Local Computations in 
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We now focus on the technical details. We present the functions H", H", H2 , and H2 , 
which compute the described greatest lower and least upper bounds. For this purpose, we 
need the following definitions. Let Pr{C\B) denote u for all (C|i?)[ii, ii] G KB. 

A node 5 is a leaf if it does not have any children. For all leaves B, let B^^ = B. For all 
the other nodes B, let 5^ be the conjunction of all the children of B. For all leaves C, let 
-L(C) = C. For all the other conjunctive events C, let L{C) be the conjunction of all the 
leaves that are in C or that are descendants of a node in C. 

In the sequel, let S be a node and let C = B^. The case C = B refers to the initialization 
of the leaf B, the case C = Bi with a node Bi ^ B to the chaining of the arrow B ^ Bi 
and a subtree via the common node and the case C = B1B2 . . . B^ with A; > 1 nodes 
Bi,B2,. . . ,Bk to the fusion of k subtrees via the common node B. 

We define the function Hf for computing greatest lower bounds: let H{^{B,C) = a\ 
(note that a\ will coincide with the greatest lower bound of Pr{BL{C)) / Pr{B) subject to 
Pr 1= KB and Pr{B) > 0), where «i in Leaf (C = B), Chaining (C = ^i), and Fusion 
{C = B1B2 ■ ■ ■ Bk with > 1) is given as follows: 

Leaf: 

ai = 1 

Chaining: 

ai = max(0,Pr(C|S)-(l + ^^g^)) 
Fusion: 

ai = max(0, 1 - k + H{'{B, Bi)) 

To express that computes greatest lower bounds, we need the following definitions. 
Let B{B, C) comprise 5, all nodes in C and all descendants of a node in C. Let KB{B, C) 
be the set of all conditional constraints of KB over B{B, C). Let Mo{B, C) be the set of all 
models of KB{B, C) that are defined on the atomic events over B{B, C). 

Now, the function is sound and globally complete with respect to B and C iff 
(5, C) = ai is the greatest lower bound oiPr{BL{C)) / Pr{B) subject to Pr G Mo{B, C) 
and Pr{B) > 0. Thus, the next theorem shows soundness and global completeness of H^. 

Theorem 4.1 

a) For all probabilistic interpretations Pr G Mo{B,C), it holds ai ■ Pr{B) < Pr{BL{C)) . 

b) There exists a probabilistic interpretation Pr G Mo{B,C) with Pr[B) > 0, ai • Pr[B) = 
Pr{BL{C)), and Pr(BL{C)) = iffL{C)^B. 

Proof. The proof is given in full detail in Appendix B. □ 

Next, we present the functions H", H2 , and H2 for computing least upper bounds. Note 
that , and H2' show the crucial result that for exact conditional constraint trees, 

there are local probabilistic deduction techniques that are sound and globally complete. 
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In detail, let H^{B,C) = a^, H^{B,C) = and H^{B,C) = 72 (note that 02, P2, 
and 72 will coincide with the least upper bound of Pr{BL{C)) / Pr{B), Pr{BL{C)) / Pr{B), 
and Pr{L{C)) / Pr{B), respectively, subject to Pr |= KB and Pr{B) > 0), where a2, (32, 
and 72 in LEAF (C = B), Chaining (C = Bi), and Fusion {C = B1B2 . . . Bk with A; > 1) 
are given as follows: 

Leaf: 



0:2 
72 



Chaining: 

a2 = 

02 = 

72 = 

Fusion: 

Q!2 = 

/32 = 

72 = 



min(l,Pr(C|B) 



Pr(B|C) ' 



1 - Pr{C\B) ■ (1 



PriB\C) 



Pr(f7|S) ■ (1 + 



Pr(B|C) 



)) 



mm{Pr{C\B) ■ ( 



g.f(C,Ct)+l 
Pr(B|C) 



PriC\B).§^: 



Pr{C\B) ■ 



Pr{B\C) 



mm H^»(B,Bi) 

ie[l:k] 

mm H^(B,Bi) 

ie[i:k] 2V ' «y 

min( min H^{B,Bi), mm {H^{B,Bi) + Hi{B,Bj))) 



i,j£[l:k],ijtj 



The functions , and are sound and globally complete with respect to B 

and C iff H^iB, C) =012, Hl^iB, C) = P2, and H2{B, C) = 72 are the least upper bounds 

of Pr{BL{C)) / Pr{B), Pr(BL{C)) / Pr{B), and Pr{L{C)) / Pr{B), respectively, subject 
to Pr G Mo{B,C) and Pr{B) > 0. Hence, the following theorem shows soundness and 
global completeness of H2, H2 , and HrJ (actually, it shows even more to enable a proof by 
induction on the recursive definition of H2, H2 , and H2). 

Theorem 4.2 

a) For all probabilistic interpretations Pr G Mo{B,C), it holds Pr{BL{C)) < 02 ■ Pr{B), 
Pr(BL{C)) < P2 ■ Pr{B), and Pr{L{C)) < 72 • Pr{B). 

b) There exists a probabilistic interpretation Pr G Mo{B,C) with Pr[B) > 0, Pr{BL[C)) = 
a2 ■ Pr{B), and Pr{L{C)) = 72 • Pr{B). 

c) There exists a probabilistic interpretation Pr G Mo{B,C) with Pr{B) > 0, Pr{BL{C)) = 
P2 ■ Pr{B), and Pr{L{C)) = 72 • Pr{B). 
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Proof. The proof is given in full detail in Appendix B. □ 

Note that Theorem 4.2 also shows that H]{B,Bi) < H^{B,Bi) + H^{B,Bi) for all 
i G [l:k]. Thus, the expression miuj jgji./j] j-^j(772"(-B, Sj) + H2 {B , Bj)) in the definition of 
72 in Fusion can be replaced by a2 + for an increased efficiency in computing 72 by 
exploiting the already computed values of 0:2 and P2 ■ 

Briefly, by Theorems 4.1 and 4.2, the tight answer to the premise-restricted complete 
query 3{F\E)[xuX2] is given by {x^/ H^{E, E^), X2 / {E , E^)} . 

4.1.2 Conditional Constraint Trees 

We now focus on computing the tight answer to premise-restricted complete queries to 
general conditional constraint trees. In the sequel, let {B^KB) be a conditional constraint 
tree and let 3{F\E)[xi,X2] be a premise-restricted complete query. 

We may think that the local deduction technique for exact conditional constraint trees 
of Section 4.1.1 can easily be generalized to conditional constraint trees. In fact, this is true 
as far as the computation of greatest lower bounds is concerned. However, the computation 
of least upper bounds cannot be generalized that easily from exact conditional constraint 
trees to conditional constraint trees. More precisely, generalizing the computation of least 
upper bounds results in solving nonlinear programs. These nonlinear programs and our way 
to solve them are illustrated by the following chaining example. 

Let the conditional constraint tree {B,KB) be given by B = {M,N,0,P} and KB = 
{{N\M)[ui,U2],{Mmin,V2], {0\N)[xi,X2], (N|0)[yi,y2], (P|0)[ri, r2], (0|P)[si, S2]}. Let us 
consider the premise- restricted complete query 3(P|M)[2;i, 2:2]. 

By Theorem 4.2 and some straightforward arithmetic transformations, the requested 
least upper bound is the maximum of z subject to it G [1^1,1/2]: t' G b'i7?^2]) ^ G [2^152:2], 
y E [yi,y2], r £ [?'i5?'2]5 s G [si,S2], and the nonlinear inequalities in (1) to (5): 

(1) z < 1 

(2) z < + + 



uxr 



(3) z < l-u + ^ - ^ + 

^ ' — V vy vys 



uxr 



(4) z < u-^ + ^-^ + 

V / — V vy vy vy, 

(5) z < 



uxr 
vys 



In this system of nonlinear inequalities, all upper bounds of z are monotonically decreasing 
in V, y, and s. Hence, we can equivalently maximize z subject to it G [iti,it2], x G [2:1, 2:2], 
r G [ri,r2], and the nonlinear inequalities in (6) to (10): 

(6) z < 1 

(7) z < 1-U + ^ - ^ + ^ 

(8) z < l-u + ^ - ^ + 

V / — ' VI viyi viyisi 

V / — VI ' viyi viyi ' viyisi 



(10) z < 

^^^> ^ - viyisi 

For example, the requested least upper bound for ui = U2 = u and a;i = 2:2 = a; is shown in 
Fig. 5 for u,x E [0, 1], n = r2 = 0.15, vi = 0.8, yi = 0.8, and si G {0.05, 0.1}. The requested 
least upper bound for ui < U2 or xi < X2 is the maximum value over [1/1,1/2] x [a;i,a;2]. 
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Figure 5: Least Upper Bound Z2 in the Chaining Example 
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We now transform this nonlinear program into an equivalent linear program (by re- 
placing 1, 11, ux, and uxr by the new variables xm, x\^, xq, and xp, respectively). More 
precisely, the maximum of z subject to it G [?ii,ii2], x G [a;i,a;2], r G [ri,r2], and the non- 
linear inequalities in (6) to (10) coincides with the maximum of z subject to the following 
system of linear inequalities over z and xb >^ {B ^ B): 



z < Xyi\ 

z < XM + ^^-xn ^ 

— ivi J, J iM v\y\ 

z < xu - f^-a;N + ^ 

Z < 



XQ 


+ 


SI 

viyisi 


■ Xp 


XQ 


+ 




■ Xp 


viyisi 


XQ 


+ 


1--S1 


■ Xp 


viyisi 






1 


■ xp 






viyisi 



1 < a^M < 1 

Ui ■ XM < Xfi < U2 ■ XM 

■ < a^o < 2:2 ■ Xfi 

ri ■ XQ < Xp < r2 ■ XQ 



More generally, tight upper answers to premise-restricted complete queries to conditional 
constraint trees can be computed by solving similar nonlinear programs, which can similarly 
be transformed into linear programs. 

For example, let us consider the premise-restricted complete query 3(QRSTU|M)[a;i, 2:2] 
to the conditional constraint tree in Fig. 2, right side. The requested least upper bound 
is the maximum of x subject to the system of linear inequalities in Fig. 6 (we actually 
generated 72 linear inequalities of which 31 were trivially subsumed by others). Note that 
the nine variables xm to X[j correspond to the nine nodes M to U. 

X < XM 

X < fiXQ 

X < ^Xr 

X ^ 18 

X ^ 18 "^T 

X < ffa;u 

X < Xt^ + ^Xp 
X < Xf^ + ^Xq 

X < xt^ + IxR 
X < jxo + jgXp 

X < jXQ + ^XQ 

X < Ixo + fxR 

Figure 6: Generated Linear Inequalities in the Chaining Example 

Thus, in this example, the tight upper answer is computed by solving a linear program 
that has 10 variables and 72 linear inequalities. Note that computing the tight upper answer 
by the classical linear programming approach would result in solving a linear program that 
has 2^ = 512 variables and 4 • 9 — 2 = 34 linear inequalities (see Section 4.6). 
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Let us now focus on the technical details. We subsequently generalize the function Hf 
of Section 4.1.1 in a straightforward way to compute greatest lower bounds in conditional 
constraint trees. Moreover, we present a linear program for computing the requested least 
upper bound in conditional constraint trees. 

Let Pri{C\B) denote ui for all {C\B)[ui,U2] G KB. In the sequel, let S be a node and 
let C = B^. Again, the cases C = B, C = Bi with a node Bi 7^ B, and C = B1B2 . . . B^ 
with A; > 1 nodes Bi,B2,. . . ,Bk refer to Leaf, Chaining, and Fusion, respectively. 

We define the generalized function for computing greatest lower bounds in con- 
ditional constraint trees: let Hi{B,C) = ai (note that ai will coincide with the greatest 
lower bound of Pr{BL{C)) / Pr{B) subject to Pr \= KB and Pr{B) > 0), where ai in Leaf 
(C = B), Chaining (C = Bi), and Fusion {C = B1B2 . . .B^ with > 1) is given by: 

Leaf: 

ai = 1 

Chaining: 

ai = max(0,Pri(C|5)-(l + ^^;|^^)) 
Fusion: 

k 

ai = max(0, 1 - k + H^{B, Bi)) 

i=l 

Hi is sound and globally complete with respect to B and C iff H"{B,C) = a\ is the 
greatest lower bound of Pr{BL{C)) / Pr{B) subject to Pr G Mo{B,C) and Pr{B) > 0. 
Thus, the next theorem shows soundness and global completeness of H". 

Theorem 4.3 

a) For all probabilistic interpretations Pr G Mo{B,C), it holds ai ■ Pr{B) < Pr{BL{C)). 

b) There exists a probabilistic interpretation Pr G Mo{B,C) with Pr[B) > 0, ai ■ Pr[B) = 
Pr{BL{C)), and Pr(BL{C)) = iffL{C)^B. 

Proof. The claims follow from Theorem 4.1. □ 

Next, we focus on the requested least upper bound, which is computed by solving a 
linear program as described in the two examples. 

We start by defining the functions I^, and over the variables xb (B E B). Let 
/"(S, C) = a2, /^(S, C) = /32, and r{B, C) = 72, where 0:2, /32, and 72 in Leaf (C = S), 
Chaining (C = Si), and Fusion {C = B^Bs . . . B^ with A; > 1) are given as follows: 

Leaf: 

a2 = Xb 

P2 = 

72 = Xb 
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Chaining: 



mini 



mm{xB 



l-Pri{B\C) 
Pri{B\C) 



' Pri(B\C) 



■ xc + 



11^ jC, Ct) J7(C, Ct) 
Pri(B|C)' Pri{B\C) 




, Xb Xc -\- Pr^(B\C)) 



72 



njc, ct) 

Pri{B\C) 



Fusion: 




ie[l:k] 



i,jell:k],i^j 



min 



{I-{B,Bi)+lP{B,Bj))) 



The system of linear inequalities J(-B, C) is defined as the least set of linear inequalities 
over > (G G B{B, C)) that contains 1 < a;s < 1 and ui ■ xc < xh < U2 ■ xg for all 
{H\G)[ui,U2] G KB{D, C) with G i7 (that is, G is the parent of H). 

The intuition behind these definitions can now be described as follows. 

Each xg (G G B{B, C)) that satisfies J{B, C) corresponds to the exact conditional con- 
straint tree {B{B,C),KB'{B,C)), where KB'{B,C) contains the pair {H\G)[xh/xg,xh/xg] 
and {G\H)[vi,vi] for each pair {H\G)[ui,U2], {G\H)[vi,V2] E KB{B,C) with G -> iJ. 

We will show that the least upper bound of Pr{BL{C)) / Pr{B), Pr(BL{C))/ Pr{B), 
and Pr{L{C))/Pr{B) subject to Pr \= KB'{B,C) and Pr{B) > is given by r{B,C), 
I^{B,C), and n{B,C), respectively. It will then follow that the least upper bound of 
Pr{BL{C))/Pr{B), Pr(BL{C))/ Pr{B), and Pr{L{C)) / Pr{B) subject to Pr \= KB{B,C) 
and Pr{B) > is given by the maximum of I'^{B, C), I^{B, G), and I^{B, G), respectively, 
subject to all xc (G G B{B, G)) satisfying J{B, G). 

That is, we implicitly performed the variable transformation described in the two ex- 
amples. This transformation is indeed correct for conditional constraint trees: 

Lemma 4.4 

a) IfxQ (G G B{B,C)) satisfies J{B,C), then for all conditional constraints (if|G)[ui, U2] G 
KB{B, C) such that G ^ H, there exists uh G [iti, it2] with xh = uh ■ xc ■ 

b) Let Uh G [iti,it2] for all {H\G)[ui,U2\ G KB{B^C) such that G ^ H. There exists xq 
(G G B{B^ G)) with J{B, C) and xh = uh ■ xc for all nodes H with parent G. 

Proof, a) For all nodes H with parent G, let uh be defined by un = xu / xq- 

b) Let xb = 1, and for all nodes H with parent G, let Xfj be defined by xh = uh ■ xq- n 

We are now ready to formulate an optimization problem for computing the requested 
least upper bound. 



Theorem 4.5 Let X2 he the maximum of x subject to x < I°'{E,E'^) and J{E,E'^). 

a) Pr{EL{E^)) < X2 ■ Pr{E) for all Pr E Mo{E,E^). 

b) There exists Pr G MoiE.E"^) with Pr{E) > and Pr{EL{E^)) = X2 ■ Pr{E). 
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Proof. Let Pr{B\C) = vi for all {B\C)[vi,V2] G KB such that B ^ C. By Theorem 4.2, 
the requested least upper bound is the maximum of X subject to a; < H^{E,E^) and 
Pr{C\B) = uc ^ [ui,U2] for all {C\B)[ui,U2] e KB such that B ^ C. By Lemma 4.4, we 
can equivalently maximize x subject to x < I"{E,E'^) and J{E,E^). □ 

We now wonder how to solve the generated optimization problem, since I'^{E, E^) may 
still contain min-operations that cannot be tackled by linear programming. Moreover, given 
a method for solving this optimization problem, we are also interested in a rough idea on 
the overall time complexity of computing the requested least upper bound this way. Finally, 
we are interested in possible improvements to increase efficiency. These topics are discussed 
in the rest of this section. 

If E"^) does not contain any min-operations at all, then the generated optimization 
problem is already a linear program. Otherwise, it can easily be transformed into a linear 
program. In a first transformation step, all inner min-operations are eliminated. This can 
easily be done due to the well-structuredness of I°'{E^E'^). In a second step, the only 
remaining outer min-operation is eliminated by introducing exactly one linear inequality 
for each contained operand. In these linear inequalities, the operands of the outer min- 
operation are upper bounds of x. 

To get a rough idea on the time complexity of computing the requested least upper 
bound this way, we must analyze the size of the generated linear programs. It is given 
by the number of variables, the number of linear inequalities in J{E,E'^), and the num- 
ber of linear inequalities extracted from x < I°'{E,E^). The latter is quite worrying, 
since n{B,C) in Fusion seems to produce many min-operands. Moreover, n{B,C) in 
Fusion contains I"{B,Bi), and I"{B,C) in Chaining contains P{C,C^). So, due to 
this crossed dependency, the overall number of generated linear inequalities is likely to 
'explode' for trees that branch very often. 

To avoid these problems, we introduce the auxiliary functions J", J^, and over the 
variables xb {B e B). Let J" (5, C) = 4, J^{B, C) = and JT(S, C) = 7^, where a^, 
and 72 in LEAF {C = B), Chaining (C = Bi), and Fusion {C = B1B2 . . . B^ with A; > 1) 
are given as follows: 



Leaf: 



"2 = 
P2 = 



72 = Xb 
Chaining: 



"2 = TOam{xB, Xc + Pr\(B\C) ^ XB-XC+ Pr\[B\C) ) 

g, _ l-Prr{B\C) jHC,C^) 

1^2 - Pri{B\C) -f- Pri{B\C) 

, _ J''{C.C^) 

^2 - Pri(B|C) 
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Fusion: 

a'2 = mill J"{B,Bi) 

«e[l:/c] 

= min J/5(S,S,) 
io = min( min JT(S,Si), min (J''(B,Bi) + J^(B,BA)) 

Note that a'2 in Chaining can be separated into the cases = C and 7^ C. Since 
simply a2 = xc for = C, we reduce the number of generated Hnear inequalities this way. 

The next lemma shows that the functions and P can be expressed in terms of 

the auxiliary functions J", J^, and J"* . 

Lemma 4.6 For all xb {B G B) that satisfy J{E,E'^): 

Q!2 = min(Q;^,7^), P2 = min(/3^,7^), and 72 = 72 • 

Proof sketch. The claim can be proved by induction on the recursive definition of the 
functions I^, and H. □ 

Briefiy, by Theorem 4.3, Theorem 4.5, and Lemma 4.6, the tight answer to the premise- 
restricted complete query B{F\E)[xi,X2] is given by {xi/ Hf{E, E'^), X2/X2}, where X2 is 
the maximum of subject to < J"(E,E'^), x < J^{E,E^), and J{E,E^). 

In our example, we get {a;i/0.00, a;2/0.27} as the tight answer to the premise-restricted 
complete query 3(QRSTU|M)[2;i, 2:2] to the conditional constraint tree in Fig. 2, right side. 

The time complexity of computing the requested greatest lower bound and especially 
the requested least upper bound this way is analyzed in Section 4.5. 

4.2 Strongly Conclusion-Restricted Complete Queries 

We now focus on computing the tight answer to strongly conclusion-restricted complete 
queries to general conditional constraint trees. In the sequel, let {B,KB) be a conditional 
constraint tree and let 3{F\E)[xi,X2\ be a strongly conclusion- restricted complete query. 

The tight upper answer to 3{F\E)[xi,X2\ is always given by {a;2/l}. To compute the 
tight lower answer to a:;2], we first compute the tight lower answer {yi/ui} to the 

premise-restricted complete query 3{E\F)[yi,y2\. We then distinguish the following cases: 

If iti > 0, then the tight lower answer to 3{F\E)[xi^X2\ is computed locally by a function 
Hi (like the tight lower answer to premise- restricted complete queries in Section 4.1.2). 
li u\ = and E ^ F, then the tight lower answer to 3{F\E)[xi, X2] is given by {xi/1}. 
Otherwise, the tight lower answer to zi{F\E)[xi,X2] is given by {xi/0}. 

We now focus on the technical details. Let {B, be the directed graph that belongs 
to the premise-restricted complete query 3{E\F)[yi,y2] (see Section 4.1.1). Let Pri{B\C) 
denote vi for all {B\C)[vi,V2] G KB. In the sequel, let B be a node and let C = B^. Again, 
the cases C = B, C = Bi with a node Bi 7^ B, and C = B1B2 ■ ■ ■ B^ with k > 1 nodes 
Si, S2, . . . , -Bfc refer to Leaf, Chaining, and Fusion, respectively. 

We define the function Hf for computing greatest lower bounds in the case Hi{B, C) > 
as follows. Let Hf{B,C) = Si (note that 5i will coincide with the greatest lower bound 
of Pr{BL{C)) / Pr{L{C)) subject to Pr \= KB and Pr{L{C)) > 0), where in Leaf 
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(C = B), Chaining (C = Bi), and Fusion (C = BjBs ■■■Bk with A; > 1) is given as 
follows (note that iJf (C, C^) and H^{B,Bi) are defined like in Section 4.1.2): 

Leaf: 

6, = 1 

Chaining: 

s, = g/(fict).(i + ^;ffjg-^ ) 

Fusion: 

min H^(B,Biy(l/H{(B,Bi)-l) 

Si = 1 + '^ 

l-k+Y^H^iB,Bi) 
\ t=i / 

By induction on the definition of Hf, it is easy to see that Hi{B, C) > entails that 
is defined and that 5i > (note that Hf^{B,C) = «i in Leaf, Chaining, and Fusion is 
defined like in Section 4.1.2). In this case, is sound and globally complete with respect 
to B and C iff Hf{B, C) = Si is the greatest lower bound of Pr{BL{C)) / Pr{L{C)) subject 
to Pr G Mo{B,C) and Pr{L{C)) > 0. Thus, the next theorem shows soundness and global 
completeness of Hf. It also shows that, for C = Bj B2 ■ ■ ■ B^ with k > 1, the least upper 
boundof Pr(5i:(C))/Pr(L(C)) subject to Pr G Mo{B,C) and Pr{L{C)) > is given by 1. 

Theorem 4.7 

a) Ifai > 0, then for all Pr G Mo{B,C), it holds 61 ■ Pr{L{C)) < Pr{BL{C)). 

h) If ai > 0, then there is a probabilistic interpretation Pr G Mo{B,C) with Pr{B) > 0, 
Pr{L{C)) > 0, 5i ■ Pr{L{C)) = Pr{BL{C)), and ai ■ Pr{B) = Pr{BL{C)). 

c) If ai > and C = B1B2 ■ ■ ■ Bf^ with k > 1, then there is some Pr E Mo{B,C) with 
Pr{B) > 0, Pr{L{C)) > 0, I ■ Pr{L{C)) = Pr{BL{C)), and ai ■ Pr{B) = Pr{BL{C)). 

d) If ai = and C = Bj B2 ■ ■ ■ B^ with k > 1, then for each e > there is some Pr G Mo{B, C) 
with Pr{B) > 0, Pr{L{C)) > 0, 1 • Pr{L{C)) = Pr{BL{C)), and e ■ Pr{B) > Pr{BL{C)). 

Proof. The proof is given in full detail in Appendix C. □ 

We are now ready to give the following characterization of tight answers to strongly 
conclusion-restricted complete queries to conditional constraint trees. 

Theorem 4.8 Let {B,KB) be a conditional constraint tree and let B{F\E)[xi,X2] be a 
strongly conclusion-restricted complete query. Let the tight lower answer to the premise- 
restricted complete query B{E\F)[yi,y2] be given by {yi/ui}. 

(1) If ui > 0, then the tight answer to 3{F\E)[xi,X2] is given by {xi/ Hf{F, F^^), X2/I}. 

(2) Ifui = and E ^ F, then the tight answer to 3{F\E)[xi, X2] is given by {xi/1, X2/l}- 

(3) Otherwise, the tight answer to B{F\E)[xi,X2] is given by {xi/0, 2:2/1}. 
Proof. The proof is given in full detail in Appendix C. □ 
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4.3 Complete Queries 

We now show that the problem of computing tight answers to complete queries can be 
reduced to the problems of computing tight answers to premise-restricted complete queries 
and of computing tight answers to strongly conclusion-restricted complete queries. 

In detail, a complete query is premise-restricted, it is strongly conclusion-restricted, 
or it can be reduced to premise-restricted complete queries and to strongly conclusion- 
restricted complete queries. For example, given the complete query 3(STU|MQR)[a;i, 2:2] 
to the conditional constraint tree in Fig. 2, right side, we first compute the tight answer 
{yi/ui, 2/2/^^2} to the premise- restricted complete query 3(MQR|0)[yi, ^2] (directed to the 
corresponding subtree) and the tight answer {zi/vi, Z2/V2} to the strongly conclusion-re- 
stricted complete query 3(0|MQR)[2;i, 2:2] (directed to the corresponding subtree). We then 
generate a new conditional constraint tree by replacing the subtree over the nodes M, N, 
0, Q, and R by the pair of conditional constraints (B|0)[iti, 1*2] and (0|B)[t)i, t)2] over the 
nodes B and (note that B represents MQR). Finally, we compute the tight answer to the 
premise-restricted complete query 3(STU|B)[2;i, 2:2] to the new conditional constraint tree. 

Note that this reduction can always be done, since for each query 3{F\E)[xi,X2], all 
paths from a basic event in i? to a basic event in F have at least one basic event in common. 

Theorem 4.9 Let (B, KB) be a conditional constraint tree and let 3{F\E)[xi, X2] be a com- 
plete query that is not premise-restricted and not strongly conclusion-restricted. 

a) There exists a basic event G E B and two conditional constraint trees {Bi,KBi) and 
{B2, KB2) such that 61(162 = {G}, BiU 62 = 6, and B{G\E)[zi, Z2] is a strongly conclu- 
sion-restricted complete query to {61, KBi). 

b) Let the tight answer to the premise-restricted complete query 3(i?|G)[yi, 1/2] to {61, KBi) 
be given by {yi/ui, y2/u2} and let the tight answer to the strongly conclusion-restricted 
complete query 3{G\E)[zi, Z2] to {Bi,KBi) be given by {zi/vi, Z2IV2}. 

(1) If ui > 0, then also vi > and the tight answer to the complete query B{F\E)[xi,X2] 
to {B,KB) coincides with the tight answer to the premise-restricted complete query 
3{F\B)[xi,X2] to {62 U {B},KB2 U {{B\G)[ui,U2], {G\B)[vi,V2]}), where B is a new 
basic event with B ^ 62- In particular, for exact conditional constraint trees (6, KB), 
the tight answer to the complete query B{F\E)[xi,X2] is given by: 

{^1/ max(0, .1 - ^ + i^), X2/ min(l, 1 - «i + ^, j^^)} , 

where si = H^{G,G^), S2 = H^{G,G^), andt2 = H^{G,G^) (note that H^, H^, and 
II2 are defined like in Section ^.1.1). 

(2) //iti =0, wi = 1, andG^F, then the tight answer to the complete query B{F\E)[xi,X2] 
to {B,KB) is given by {xi/1, X2/I}. 

(3) Otherwise, the tight answer to the complete query 3{F\E)[xi, X2] to (6, KB) is given 
by {xi/0, X2/I}. 

Proof. The proof is given in full detail in Appendix D. □ 
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4.4 Queries 

The problem of computing tight answers to queries can be reduced to the more specialized 
problem of calculating tight answers to complete queries. 

More precisely, given a query 3{F\E)[xi,X2] to a conditional constraint tree {B,KB), 
a complete query 3{F'\E')[xi, X2] to a conditional constraint tree {B', KB') is generated by: 

1. While {B, KB) contains a leaf B that is not contained in EF: remove B from B and 
remove the corresponding pair {C\B)[ul,U2]^ {B\C)[vi,V2] G KB from KB. 

2. While EF contains a basic event B that is not a leaf in {B, KB): increase B hj a. new 
basic event B', increase KB by the pair {B'\B)[1, 1] and {B\B')[1, 1], and replace each 
occurrence of B in 3{F\E)[xi,X2] by the new basic event B'. 

It remains to show that the generated probabilistic deduction problem has the same 
solution as the original probabilistic deduction problem: 

Theorem 4.10 The tight answer to the query 3{F\E)[xi,X2] to {B,KB) coincides with the 
tight answer to the complete query 3{F'\E')[xi,X2] to {B',KB'). 

Proof. Let {B'\ KB") be the conditional constraint tree that is generated in step 1 and let 
{F\E)[ui,U2] be a tight logical consequence of KB". We now show that {F\E)[ui,U2] is also 
a tight logical consequence of KB. First, {F\E)[ui,U2] is a logical consequence of KB, since 
KB" is a subset of KB. Moreover, each model Pr" of KB" (that is defined on all atomic 
events over B") can be extended to a model Pr of KB (that is defined on all atomic events 
over B) with Pr{A) = s ■ Pr"{A) for all atomic events A over B" that are different from the 
conjunction of all negated basic events in B", where s is a real number from (0, 1]. This 
model can be constructed inductively like in the proof of Theorem 3.2. Thus, for u G [ni, it2], 
Pr"{E) >Oandu- Pr"{E) = Pr"{EF) entails Pr{E) > and • Pr{E) = Pr{EF). 

Finally, {F\E)[ui.,U2\ is a tight logical consequence of KB" iff {F'\E')[ui.,U2\ is a tight 
logical consequence of KB' ^ since we just introduce synonyms for basic events in step 2. □ 

4.5 Computational Complexity 

4.5.1 Exact Conditional Constraint Trees 

We now show that for exact conditional constraint trees, our technique to compute the tight 
answer to queries runs in linear time in the number of nodes of the tree. In the sequel, let 
{B, KB) be an exact conditional constraint tree and let n denote its number of nodes. 

Lemma 4.11 The tight answer to a premise-restricted or strongly conclusion-restricted 
complete query can he computed in linear time in n. 

Proof. For exact conditional constraint trees, our approach to compute the tight upper 
answer to premise-restricted complete queries by H2, H2 , and H2 runs in time 0{n): 

The directed tree can be computed in time 0{n). An initialization of a leaf with a 
constant number of assignments is performed exactly for each leaf of the directed tree, a 
chaining with a constant number of arithmetic operations is performed exactly for each 
arrow of the directed tree. Hence, initializing all leaves and performing all chainings runs 
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in time 0{n). A fusion is done for each branching of the directed tree, using linear time in 
the number of branches. Thus, all fusions together run in time 0{n). 

Even for general conditional constraint trees, the tight lower answer to premise-restricted 
complete queries, and hence also the tight answer to strongly conclusion-restricted complete 
queries, is analogously computed in time 0{n). □ 

Theorem 4.12 The tight answer to a query can be computed in linear time in n. 

Proof. We assume that the set of basic events B is totally ordered and that the basic events 
in the conjunctive events E and F of the query 2:2] are written in this order. 

First, the query is reduced to a complete query according to Section 4.4. This reduction 
can be done in time 0{n). Now, if the generated complete query is premise- restricted or 
strongly conclusion-restricted, then the claim follows immediately from Lemma 4.11. 

Otherwise, the generated complete query is reduced to premise-restricted and strongly 
conclusion-restricted complete queries according to Section 4.3. Also this reduction can be 
done in time 0(n), since the basic event G in Theorem 4.9 a) is computable in time 0{n). 
Hence, the claim follows from Theorem 4.9 and Lemma 4.11. Note that t2 = H2{G, G^) in 
Theorem 4.9 b) (1) can also be computed in time 0{n). □ 

4.5.2 Conditional Constraint Trees 

For general conditional constraint trees, our technique to compute the tight lower answer 
to queries runs still in linear time, while our technique to compute the tight upper answer 
to queries runs in polynomial time in the number of nodes of the tree. In the sequel, let 
{B, KB) be a general conditional constraint tree and let n denote its number of nodes. 

Lemma 4.13 

a) The tight lower answer to a premise-restricted complete query and the tight answer to a 
strongly conclusion-restricted complete query can be computed in linear time in n. 

b) The tight upper answer to a premise-restricted complete query can be computed in poly- 
nomial time in n. 

Proof, a) The claim is already shown in the proof of Lemma 4.11. 

b) Our linear programming technique to compute the tight upper answer to premise- 
restricted complete queries runs in polynomial time in n: 

Linear programming runs in polynomial time in the size of the linear programs (Pa- 
padimitriou & Steiglitz, 1982; Schrijver, 1986), where the size of a linear program is given 
by its number of variables and its number of linear inequalities. 

We now show that the size of our linear programs in Section 4.1.2 is polynomial in n. 
The number of variables is n + 1. The number of linear inequalities in J{E,E^) is 2n. 
By induction on the recursive definition of J", , and J"' , it can be shown that the 
number of min-operands in J°'{B,C), J'^{B,C), and r{B,C) is limited by \B{B,C)\'^, 
\B{B,C)\, and \B{B,C)\'^, respectively. Hence, the number of linear inequalities extracted 
from X < J"{E, E^) and x < J^{E, E^) is limited by \B{E, Et)|2 + \B{E, E^)\^ = + 
Thus, the overall number of generated linear inequalities / is limited by 1^ = 2n + ri^ + . 
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Finally, note that 1^ is a very rough upper bound for I, in many conditional constraint 
trees (especially in those that branch very rarely), / is much lower than l^. For example, 
taking a complete binary tree with n = 127 nodes, we get only / = 19 964 compared to 
lu = 260161024. In the example of Section 4.1.2 with n = 9 nodes, we get only I = 72 
compared to = 6 660. Another example is a tree that is degenerated to a chain of basic 
events. In this case, we even get / = 5n + 1, that is, the overall number of generated linear 
inequalities is linear in n. □ 

Theorem 4.14 

a) The tight lower answer to a query can be computed in linear time in n. 

b) The tight upper answer to a query can be computed in polynomial time in n. 

Proof. We assume that the set of basic events B is totally ordered and that the basic events 
in the conjunctive events E and F of the query 3{F\E)[xi, X2] are written in this order. 

Like in the proof of Theorem 4.12, the query is reduced to a complete query according 
to Section 4.4. This reduction can be done in time 0{n). Now, if the generated com- 
plete query is premise-restricted or strongly conclusion-restricted, then the claims follow 
immediately from Lemma 4.13. 

Otherwise, the generated complete query is reduced to premise-restricted and strongly 
conclusion-restricted complete queries according to Section 4.3. Again, this reduction can 
be done in time 0(n), since the basic event G in Theorem 4.9 a) is computable in time 0{n). 
Thus, the claims follow from Theorem 4.9 and Lemma 4.13. Note that in Theorem 4.9 b) (1), 
the tight lower answer to B{F\B)[xi,X2] can be computed without U2 and V2- □ 

4.6 Comparison with the Classical Linear Programming Approach 

As a comparison, we now briefly describe how probabilistic deduction in conditional con- 
straint trees can be done by the classical linear programming approach (Paafi, 1988; van 
der Gaag, 1991; Amarger et al. 1991; Hansen et al. 1995). In the sequel, let 3{F\E)[xi,X2] 
be a query to an exact or general conditional constraint tree {B, KB) over n nodes. 

The tight answer to 3(F|i?)[.2;i, 2:2] can be computed by solving two linear programs. In 
detail, the requested greatest lower and least upper bound are given by the optimal values 
of the following two linear programs with xa >0 {A ^ Ab) and opt G {min, max}: 

opt 12AeAB,A^EFXA subject to 
T,A£Ab,A^E^A = 1 

T.AeAB,A^GHXA > Ul ■ EagAb,a^gXa for all {H\G)[uuU2] G KB 
T,agAb,a^ghXa < U2 ■ J2agAb,a^gXa for all {H\G)[uuU2] G KB 

That is, the tight answer is computed by solving two linear programs with 2" variables 
and 4n — 2 linear inequalities. For example, the tight answer to the premise-restricted 
complete query 3(QRSTU|M)[a;i, 0:2] to the conditional constraint trees in Fig. 2 yields two 
linear programs with 2^ = 512 variables and 4 • 9 — 2 = 34 linear inequalities. 
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Hence, if we now solve these two linear programs by the standard simplex method or 
the standard interior-point technique, then we need immediately exponential time in n. It 
is still an open question whether column generation techniques can help to solve the two 
linear programs in less than exponential time in n in the worst case. 

5. Comparison with Bayesian Networks 

In this section, we briefly discuss the relationship between conditional constraint trees and 
Bayesian networks (Pearl, 1988). 

A Bayesian network is deflned by a directed acyclic graph G over discrete random vari- 
ables Xi, X2, . . . , Xn as nodes and by a conditional probability distribution Pr(Xj|pa(Xj)) 
for each random variable Xi and each instantiation pa(Xj) of its parents pa(Xj). It specifies 
a unique joint probability distribution Pr over Xi, X2, . . . , X„ by: 

n 

Pr(Xi,X2,...,X„) = l[Pr{Xi\MXi)). 

That is, the joint probability distribution Pr is uniquely determined by the conditional 
distributions Pr{Xi\pa.{Xi)) and certain conditional independencies encoded in G. 

Hence, Bayesian trees (that is, Bayesian networks that have a directed tree as associated 
directed acyclic graph) with only binary random variables seem to be very close to exact 
conditional constraint trees. However, exact and general conditional constraint trees are 
associated with an undirected tree that does not encode any independencies! For this rea- 
son, exact and general conditional constraint trees describe convex sets of joint probability 
distributions rather than single joint probability distributions. 

But, would it be possible to additionally assume certain independencies? Of course, with 
each exact or general conditional constraint tree {B, KB), we can associate all probabilistic 
interpretations Pr that are models of KB and that have additionally the undirected tree 
{B,-^) as an I-map (Pearl, 1988). That is, we would have independencies without causal 
directionality like in Markov trees (Pearl, 1988). However, this idea does not carry us 
to a single probabilistic interpretation (neither for exact conditional constraint trees, nor 
for general conditional constraint trees), and it is an interesting topic of future research to 
investigate how the computation of tight answers in exact and general conditional constraint 
trees changes under this kind of independencies (which yield tighter bounds, since they 
reduce the number of models of exact and general conditional constraint trees). 

Finally, if we additionally flx the probability of exactly one node, then an exact condi- 
tional constraint tree under the described independencies specifies exactly one probabilistic 
interpretation (note that, to keep satisfiability, the probability of a node must respect certain 
upper bounds, which are entailed by the exact conditional constraint tree). But, such exact 
conditional constraint trees are in fact Bayesian trees with only binary random variables. 

6. Summary and Conclusions 

We showed that globally complete probabilistic deduction with conditional constraints over 
basic events is NP-hard. We then concentrated on the special case of probabilistic deduction 
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in exact and general conditional constraint trees. We presented very efficient techniques for 
globally complete probabilistic deduction. More precisely, for exact conditional constraint 
trees, we presented a local approach that runs in linear time in the size of the conditional 
constraint trees. For general conditional constraint trees, we introduced a global approach 
that runs in polynomial time in the size of the conditional constraint trees. 

Probabilistic deduction in conditional constraint trees is motivated by previous work 
in the literature on inference rules. It generalizes patterns of commonsense reasoning that 
have been thoroughly studied in this work. Hence, we presented a new class of tractable 
probabilistic deduction problems, which are driven by artificial intelligence applications. 

It is also important to note that the deduction process in exact and general conditional 
constraint trees can easily be elucidated in a graphical way. For example, the computation 
of the tight answer to the premise-restricted complete query 3(QRSTU|M)[2;i, 2:2] to the 
exact conditional constraint tree in Fig. 2, left side, can be illustrated by labeling each node 
of the directed tree in Fig. 3 with the corresponding tightest bounds of Table 1. 

Like Bayesian networks, conditional constraint trees are well-structured probabilistic 
knowledge bases that have an intuitive graphical representation. Differently from Bayesian 
networks, conditional constraint trees do not encode any probabilistic independencies. Thus, 
they can also be understood as a complement to Bayesian networks, useful for restricted 
applications in which well-structured independencies do not hold or are difficult to access. 

Conditional constraint trees are quite restricted in their expressive power. However, in 
more general probabilistic knowledge bases, probabilistic deduction in conditional contraint 

trees may always act as local inference rules. For example, in case we desire explanatory 
information on some specific local deductions from a subset of the whole knowledge base 
(which could especially be useful in the design phase of a probabilistic knowledge base). 

An important conclusion of this paper concerns the question whether to perform prob- 
abilistic deduction by the iterative application of inference rules or by linear programming. 
The techniques of this paper have been elaborated by following the idea of inference rules in 
probabilistic deduction. Hence, on the one hand, this paper shows that the idea of inference 
rules can indeed bring us to efficient techniques for globally complete probabilistic deduction 
in restricted settings. However, on the other hand, given the technical complexity of the 
corresponding proofs, it seems unlikely that these results can be extended to probabilistic 
knowledge bases that are significantly more general than conditional constraint trees. 

That is, as far as significantly more general probabilistic deduction problems with con- 
ditional constraints are concerned, the iterative application of inference rules does not seem 
to be very promising for globally complete probabilistic deduction. Note that a similar 
conclusion is drawn in a companion paper (1998a, 1999a), which shows the limits of locally 
complete inference rules for probabilistic deduction under taxonomic knowledge. 

For example, probabilistic deduction from probabilistic logic programs that do not as- 
sume probabilistic independencies (Ng & Subrahmanian 1993, 1994; Lukasiewicz, 1998d) 
should better not be done by the iterative application of inference rules. Note that much 
more promising techniques are, for example, global techniques by linear programming 
(Lukasiewicz, 1998d) and in particular approximation techniques based on truth-functional 
many- valued logics (Lukasiewicz 1998b, 1999b). 
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Appendix A. Preliminaries of the Proofs for Sections 4.1 to 4.3 

In this section, we make some technical preparations for the proofs of Theorems 4.1, 4.2, 
4.7, 4.8, and 4.9. In the sequel, we use the notation 





Xl,2 








7-2 


Cl 


C2 





as an abbreviation of the following system of equations: 

^^^^ 2;i,i+a;i,2 = n xx,\^X2,\ = ci 

2:2,1 + 2:2,2 = r2 xi^2 + 2:2,2 = C2 . 

The next lemma provides the optimal values of two linear programs to be solved in the 
proofs of Theorems 4.1, 4.2, 4.7, 4.8, and 4.9. 

Lemma A.l Let ri, r2, ci, C2 > with ri + r2 = ci + C2. For all i, j G {1, 2}: 

a) min(ri,Cj) = max Xij subject to (11) and Xn^m > for all n,m ^ {Ij^}. 

b) max(0, — C3_j) = min Xij subject to (11) and Xn^m > for all n,m ^ {Ij^}. 
Proof. The claims can easily be verified (Lukasiewicz, 1996). □ 

Let us assume that a conditional constraint tree is the union of two subtrees that have 
just one node in common. A model of each subtree and a third model related to the 
common node can now be combined to a model of the whole conditional constraint tree. 
This important result follows from the next lemma. 

Lemma A. 2 Let Bi and B2 be sets of basic events with BinB2 = 0. Let Bo be a new basic 
event that is not contained in BiU B2- Let Pri and Pr2 be probabilistic interpretations on 
the atomic events over BiU {Bq} and B2^ {Bq}, respectively. Let Bi and B2 be conjunctive 
events over Bi and B2, respectively. Let Pro be a probabilistic interpretation on the atomic 
events over {i?o, -61,^2} with Pro{HoHi) = Pri{HoHi) and ProiHoHs) = Pr2{HoH2) for 
all atomic events Ho, Hi, and H2 over {^0}, {-Bi}, and {B2}, respectively. 

There is a probabilistic interpretation Pr on the atomic events over Bi U B2U {Bq} with: 

^^2^ Pr{HoHiH2) = Pro{HoHiH2), 

Pr{HoA,) = Pri{HoAi), and PriHgAs) = Pr2{HoA2) 
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for all atomic events Ho, Hi, H2, Ai, and A2 over the sets of basic events {Bo}, {Bi}, 
{B2}, Bi, and B2, respectively. 

Proof. Let the probabilistic interpretation Pr on the atomic events over BiU B2U {Bo} 
be defined as follows: 



Pr{HoAiA2) 



■pro{HoH,H,) . . if PriiHoH,) ■ Pr2{HoH,) > 

if PriiHoHi)- Pr2{HoH2) = 



for all atomic events Ho. Ai, and A2 over {.Bo}, Bi, and B2, respectively, with atomic events 
Hi over {^i} and H2 over {B2} such that Ai ^ Hi and A2 ^ H2. 

Now, we must show that Pr satisfies (12). Let Ho, Hi, and H2 be atomic events over 
{^o}, {^i}, and {^2}, respectively. For Pri{HoHi) > and Pr2{HoH2) > 0, we get: 

Pr{HoH,H,) = E Pro{HoH,H,) ■ ■ gfiffj = Pro{HoH,H,) . 

For PniHoHi) = or Pr2{HoH2) = 0, we get PriHgHiHs) = = ProiHgHiHs). 

Let Ho, Hi, and Ai be atomic events over {Bo}, {Bi}, and Bi, respectively, with 
Ai^ Hi. For Pri{HoHi) > 0, Pr2{HoB2) > 0, and Pr2{HoB2) > 0, it holds: 

P^HoA,) = E Pro{HoH,B,).^^y^^^ 

-L Pr. ( TT TT ~R \ PrijHoAi) Pr2{HoA2) 

+ E _ Pro{HoHiB2) ■ Pr.^HoH,) ■ Pr2{HoB2) 

A2eAB2,A2^B2 

Pro(g,iJ,)- g;jgi;j = PriiHoA,). 

For Pri{HoHi) > 0, Pr2{HoB2) > 0, and Pr2{HoB2) = 0, we get: 
P^iHoA,) = E ProiHoH,B2)-g^-g^ 

Pro(gog,)- g;[g^;j = PriiHoA,). 

The proof is similar for Pri{HoHi) > 0, Pr2{HoB2) = 0, and Pr2{HoB2) > 0. 
For Pri{HoHi) = 0, we get Pr(i?o^i) = = Pri{HoAi). 

Finally, the proof of Pr{HoA2) = Pr2{HoA2) for all atomic events Ho over {^0} and 
A2 over B2 can be done analogously. □ 

Appendix B. Proofs for Section 4.1 

In this section, we give the proofs of Theorems 4.1 and 4.2. That is, we show the global 
soundness and the global completeness of the functions i?", H2, H^, and H2 ■ The proofs 
are done by induction on the recursive definition of , H^, H^ , and H^' . 
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To prove global soundness, we just have to show the local soundness of the computations 
in Leaf, Chaining, and Fusion. To prove global completeness, we construct two models 
of the conditional constraint tree, one related to the greatest lower bound and another one 
related to the least upper bound computed in Leaf, Chaining, and Fusion. 

For Leaf, such a model is trivially given. For Chaining, we combine a model of the 
arrow, a model of the subtree, and a model connected to the common node to a model of the 
extended conditional constraint tree. For Fusion, we combine models of the subtrees and 
a model connected to the common node to a model of the extended conditional constraint 
tree. More precisely, for Chaining and Fusion, the models of the subtrees are related to 
previously computed tightest bounds, while the model connected to the common node is 
related to the tightest bounds computed in the running Chaining or Fusion. 

We need the following technical preparations. The next lemma helps us to show the 
global completeness of the functions H^, , and in Chaining and Fusion. 

Lemma B. 3 a) For all real numbers u,v ^ (0; 1]; ^2 G [0; 1]; andx2,Z2 G [0, oo) with 
X2,X2 < Z2 and Z2 < X2 + X2, there is some x G [z2 — X2,X2] with: 

(13) minfl, ^,l-u+^,u-^ + ^) = minfl, ^,1 - u + + ^) . 

b) For V2,X2 G [0,1] and V2,X2,W2, Z2 G [0, CX)) with V2 < W2, V2 < W2, X2 < Z2, X2 < Z2, 

W2 <V2 + V2, and Z2 < X2 + X2, there is v & [w2 — V2, V2] and x & [z2 — X2,X2] with: 

^^^^ m.m{w2,Z2,V + Z2 - X,X + W2 - v) = mm{w2,Z2,V2 +X2,X2 + V2) 

min(?;,a;) = min(?;2,a;2) ■ 
Proof. The claims can easily be verified (Lukasiewicz, 1996). □ 

The following lemma helps us to prove the local soundness and the local completeness 
of the functions Hf, H^, H^, and in Chaining and Fusion. 

Lemma B. 4 a) Let u,v & (Oil]; x G [0,1], and x G [0, cx)). For all probabilistic inter- 
pretations Pr with Pr{B) > 0, the conditions u ■ Pr{D) = Pr{BC), v ■ Pr{C) = Pr{BC), 
X ■ Pr{C) = Pr{CL{C^)), andx-Pr{C) = Pr(CL{C^)) are equivalent to: 



PriBCLiCf)) 
Pr{B) 


Pr{BCL{C^)) 
Pr{B) 


Pr{B C) 
Pr{B) 


PriBCL{Cf)) 
Pr{B) 


Pr(BCL(Cf)) 
Pr{B) 


-.-^ 


Pr{BC L{C^)) 
Pr{B) 


Pr{B'CL{C^)) 
Pr(B) 


l-u 


Pr{BCL{Cf)) 
Pr{B) 


Pr{BCL{Cf)) 
Pr{B) 


U 


PriCLiCT-)) 
Pr{B) 


ux 

V 




u ux 

V V 


ux 

V 





b) Letv,x G [0, 1] andv,x G [0, 00). For all probabilistic interpretations Pr with Pr[B) > 0, 
the conditions v ■ Pr{B) = Pr{BL{G)), v ■ Pr{B) = Pr(BL{G)), x ■ Pr{B) = Pr{BL{H)), 
andx- Pr{B) = Pr{BL{H)) are equivalent to: 



Pr{B L(G) L(H)) 
Pr{B) 


Pr(B L(G)L(H)) 
Pr{B) 


Pr(BL(G)) 
Pr{B) 


Pr{BL(G) L(H)) 
Pr{B) 


Pr(BL(G)L(H)) 
Pr{B) 


l-v 


Pr{BL{G)L{H)) 
Pr(B) 


Pr(BL{r;)L(H)) 
Pr(B) 


V 


Pr{BL{G)L{H)) 
Pr{B) 


Pr{BL{G)L{H)) 
Pr{B) 


V 


Pr(BL(H)) 
Pr(B) 


X 




l-x 


X 
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Proof. The claims can be verified by straightforward arithmetic transformations based on 
the properties of probabilistic interpretations. □ 

After these preparations, we are now ready to prove the global soundness and the global 
completeness of the functions i/f , H2, H2 , and H2 ■ 

Proof of Theorem 4.1. The claims are proved by induction on the recursive definition 
of H{^. The case C = Bi . . . is tackled by iteratively splitting C into two conjunctive 
events. Thus, it is reduced to C = GH with conjunctive events G and H that are disjoint in 
their basic events. For C = Bi,we define u = Pr{C\B), v = Pr{B\C), and xi = Hf{C, C^). 
For C = Si . . . Bk, hence C = GH, let vi = H^{B, G) and xi = H^{B, H). 

a) All models Pr G Mo{B, C) with Pr{B) = satisfy the indicated condition. In the sequel, 
let Pr G Mo{B,C) with Pr{B) > 0. 

Basis: Let C = B. Since C = L{C), we get: 

ai ■ Pr{B) = 1 ■ Pr{B) = Pr{BC) = Pr{BL{C)) . 

Induction: Let C = Bi. For all models Pr2 G Mo{C, C^), we get by the induction hypothesis 
xi ■ Pr2{C) < Pr2{CL{C^)). Thus, Pr satisfies the same conditions. Since L(ci = L{C) 
and by Lemmata A.l and B.4 a), we then get: 

ai-PT{B) = max(0,?/ - ^ + ^) • Pr(S) < Pr{BL{C^)) = Pr{BL{C)) . 

Let C=GH. For all Pri^Mo{B,G) and Pr2^Mo{B,H), we get by the induction 
hypothesis vi ■ Pti{B) < Pti{BL{G)) and xi ■ Pr2{B) < Pr2{BL{H)). Thus, Pr satisfies 
the same conditions. Since L{G)L{H) = L{GH) = L{C) and by Lemmata A.l and B.4 b): 

max(0, - 1) ■ Pr(S) < Pr{BL{G)L{H)) = Pr{BL{C)) . 

b) 

Basis: Let C = B. A model Pr G Mo{B, C) such that Pr{B) > 0, 1 ■ Pr{B) = ai ■ Pt{B) = 
Pr{BL{C)), and Pr{BL{C)) = is given hyB,B^ 0, L 

Induction: Let C = Bi. Let the model Pri of {{C\B)[u,u],{B\C)[v,v\) with Pri(S)>0 
and Pri(C) > be defined like in the proof of Theorem 3.2. 

We now choose an appropriate model Pr2 G Mo{C,C'^). Let us first consider the 
case a;i > 0, v = l, or not L{C^)^C. By the induction hypothesis, there exists a model 
Pr2GMo(C,C^) with Pr2(C) > 0, xi ■ Pr2(C) = Pr2(CL(C^)), and Pr2{CL{C^)) = 
iff' L(C^) =^ C. Let us next assume 2:1 = 0, v <1, and -L(C^) C. By Theorem 3.2, there 
exists a model Pr2 G MoiC^C"^) with Pr'HCLiC^)) > 0. By the induction hypothesis, 
there exists a model Pr'2 G Mo(C, C^) with Pr^(C) > and ■ Pr'2(C) = Pr'2(CL(C^)). 
Hence, there exists a model Pr2 G Mo{C, C^) with Pr2{C) > and 

min(l -^;,Pr^'(CL(C^))/Pr^'(C)) • Pr2(C) = Pr2{CL{C^)) . 

By Lemma 3.1, we can choose Pri and Pr2 with Pri(C) = Pr2{C) and Pri{BC) > 
Pr2(CL{C^)). By Lemmata A.l and B.4 a), we can choose the probabilistic interpretation 
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Pro over {B,C,L{C^)} with Pro(^i) = Pri{Ai) and Pro(^2) = -Pr2(^2) for all atomic 
events Ai and A2 over {B,C} and {C, L(C^)}, respectively, such that: 

Pro{BCL{C^)) = max{0, Pr2{CL{C^)) - Pri(BC)) = 
Pro{BCL{C^)) = max{0, Pr2{CL{C^)) - Pri(BC)) . 

By Lemma A.2 with Bi = {B}, B2 = B{C, C^)\{C}, Bo = C, Bi = B, and B2 = L{C^), 
there exists a probabilistic interpretation Pr over B{B,C) with (12). Hence, it holds Pr G 
Mo{B,C) and Pr{B) > 0. By Lemma B.4 a), we get: 

ai-Pr{B) = max{0,u-^ + ^)- Pr{B) = Pr{BL{C^)) = Pr{BL{C)) . 

Moreover, it is easy to see that Pr{BL{C)) = iff L{C)^B. 

Let C = GH. By the induction hypothesis, there are models Pri G Mo{B,G) and 
Pr2 G Mo{B,H) wUh Pri{B) > 0, Pr2{B) > 0, vi ■ Pri{B) = Pri{BL{G)), xi ■ Pr2{B) = 
Pr2{BL{H)), Pri(BL{G)) = iff L(G)^S, and Pr2{BL{H)) = mL{H)^B. 

By Lemma 3.1, we can choose Pri and Pr2 with Pri{B) = Pr2{B) and Pri{B L{G)) > 
Pr2{BL{H). By Lemmata A.l and B.4 b), we can choose the probabilistic interpretation 
Pro over {B, L{G), L{H)} with Pro(^i) = Pri(^i) and Pro(^2) = ^r2(^2) for all atomic 
events Ai and A2 over {B,L{G)} and {B, L{H)}, respectively, such that: 

Pro(BL{G)L{H)) = m.in{Pr2{BL{H)), Pri{BL{G)) 
Pro{BL{G)L{H)) = max(0, Pr2(SL(i?)) - Pri(SL(G))) . 

By Lemma A.2 with Bi = B{B, G) \ {B}, B2 = B{B, H) \ {B}, Bq = B,Bi = L{G), and 
B2 = L{H), there exists a probabilistic interpretation Pr over B{B,C) with (12). Hence, 
it holds Pr G Mo{B, C) and Pr{B) > 0. By Lemma B.4 b), we get: 

max(0, - 1) ■ Pr(S) = Pr{BL{G)L{H)) = Pr{BL{C)) . 

Moreover, it is easy to see that Pr{BL{C)) = iff L{C)^B. □ 

Proof of Theorem 4.2. The claims are proved by induction on the recursive definition of 
Hr^, H^, and . A gain, the case C = Bi . . . B/. is tackled by iteratively splitting C into 
two conjunctive events. Thus, it is reduced to C = GH with conjunctive events G and H 
that are disjoint in their basic events. For C = let it = Pr(C|S), v = Pr{B\C), and 

X2 = H^iC, Ct), X2 = HiiC Ct), Z2 = H^iC, Ct) . 
For C = Bi . . . Bh, hence C = GH, we define: 

V2 = H^(B, G), V2 = Hi(B, G), W2 = H^{B, G) 
X2 = H^iB, H), X2 = Hi{B, H), Z2 = H^{B, H) . 

a) For Pr G Mo{B,C) with Pr{B) = 0, we get Pr{N) = for all TV G B{B,C). Thus, Pr 
satisfies the indicated conditions. Next, let Pr G Mo{B, C) with Pr{B) > 0. 
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Basis: Let C = B. Since L{C) 



Pr{BL{C)) 
Pr{BL{C)) 
Pr{L{C)) 



Pr{BL{C)) 
Pr{BL{C)) 
Pr{L{C)) 



= C, we get: 

: Pr{BC) 
-- Pr{DC) 
- Pr{C) 



= 1 ■ Pr{B) = 
= • Pr{B) = 
= 1 • Pr{B) = 



a2 ■ Pr{B) 
P2 ■ Pr{B) 
72 • Pr{B) . 



02 ■ Pr{B) 
P2 ■ Pr{B) 
72 • Pr{B) . 



Induction: Let C = Bi. For all models Pr2 G Mo{C, C^), we get by the induction hypothesis 
Pr2(CL(C^)) < X2 ■ Pr2(C), Pr2(CL{C^)) < X2 ■ Pr2(C), and Pr2(L(Ct)) < Z2 ■ Pr2{C). 
Hence, Pr satisfies the same conditions. Since L{C) = L(C^) and by Lemmata A.l and 
B.4 a), we then get: 

= Pr{BL{C^)) < mm{l,^,l -u + ^,u + ^) ■ Pr{B) = 
= Pr(BL{C^)) < min(f^ + ^ _ ^, Mi) . p^^^) = 

= Pr(L(C^)) < ^-PriB) = 



Let C = GH. For all models Pri e Mo{B, G) and all models Pr2 G Mo{B, H), we get 
by the induction hypothesis: 

Pri{BL{G)) < V2-Pri{B), Pr2{BL{H)) < X2 ■ Pr2{B) 
Pri(BL{G)) < V2-Pri{B), Pr2{BL{H)) < X2 ■ Pr2{B) 
Pri{L{G)) < W2-Pri{B), Pr2{L{H)) < Z2 ■ Pr2{B) . 

Thus, Pr satisfies the same conditions. Since L{C) = L{GH) = L{G)L{H) and by Lem- 
mata A.l and B.4 b), we get: 

Pr{BL{C)) = Pr{BL{G)L{H)) < mm{v2, X2) ■ Pr{B) 
Pr(BL{C)) = Pr(BL{G)L{H)) < mm{v2, X2) ■ Pr{B) 
Pr{L{C)) = Pr{L{G)L{H)) < mm{w2, Z2,V2 + X2,X2 + V2) ■ Pr{B) . 

b) and c) 

Basis: Let C = B. A model Pr G Mo{B,C) with Pr{B) > satisfying Pr{BL{C)) = 
1 ■ Pr{B) = a2 ■ Pr{B), Pr(BL{C)) = ■ Pr{B) = ^2 ■ Pr{B), and Pr{L{C)) = 1 ■ Pr{B) = 
72 • Pr{B) is given by i? H- 0, L 

Induction: Let C = Bi. Let the model Pri of {{C\B)[a,u],{B\C)[v,v\} with Pri{B) > 
and Pri(C) > be defined like in the proof of Theorem 3.2. 

For the proof of c), by the induction hypothesis, there is some Pr2eMo{C,C^) with 
Pr2(C) > 0, Pr2(CL{C^)) = X2 ■ Pr2(C), and Pr2{L{C^)) = ^2 • Pr2{C). 

By Lemma 3.1, we can choose Pri and Pr2 with Pri(C) = Pr2{C) and Pri{B C) > 
Pr2{CL{C'^)). By Lemma A.l, we can choose the probabilistic interpretation Pro over 
{B,C,L{C^)} with Pro(^i) = Pri(^i) and Pro(^2) = -P^2(-42) for all atomic events Ai 
and A2 over {S,C} and {C, L(C^)}, respectively, such that: 

PrQ{BCL{C^)) = m\n{Pri{BC),Pr2{CL{C^))) = Pr2(CL{C^)) 
Pro(BCL{C^)) = min(Pri(SC),Pr2(CL(Ct))) . 
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By Lemma A.2 with Bi = {B}, B2 = B{C,C^) \ {C}, Bq = C, Bi = B, and B2 = 
L(C^), there is a probabiHstic interpretation Pr over B{B,C) with (12). Hence, it holds 
Pr G Mo{B, C) and Pr{B) > 0. By Lemma B.4 a), we get: 

Pr(BL{C)) = Pr(BL{C^)) = min(^ + ^-u,^)- Pr{B) = ^2 ■ Pr{B) 
Pr{L{C)) = Pr{L{C^)) = ^ ■ Pr{B) = 72 ■ Pr{B) . 

For the proof of b), by the induction hypothesis, there are models -Pr 1^2 , -P'"2,2 £ 
Mo{C, C^) with Pri,2(C) > 0, Pr 2,2(0) > 0, and 

Pri,2(CL(Ct)) = X2 ■ PrX2{C), Pri^2{L{C^)) = ^2 • Pri,2{C) 

(15) 

Pr2,2{CL{C^)) = X2 ■ Pr2,2(C), Pr2,2{L{C^)) = ^2 • Pr2,2{C) ■ 

These conditions already entail X2 < Z2 and a;2 < ^;2- With the results from a), we addi- 
tionally get Z2 < X2 +X2- By Lemma B.3 a), there is a; G [z2 — X2,X2] with (13). By (15), 
there is Pr2 G Mo{C,C^) with Pr2(C) > and 

Pr2{CL{C^)) = X ■ Pr2{C), Pr2{L{C^)) = Z2 ■ Pr2{C) . 

By Lemma 3.1, we can choose Pri and Pr2 with Pri(C) = Pr2{C). By Lemma A.l, we 
can choose the probabilistic interpretation Pro over {B, C, L{C^)} with Pro(^i) = Pri{Ai) 
and Pro(^2) = Pr2(^2) for all atomic events Ai and A2 over {B,C} and {C, L(C^)}, 
respectively, such that: 

Pro{BCL{C^)) = mm{Pri{BC),Pr2(CL{C^))) 
ProiBCL{C^)) = mm{Pri{BC),Pr2iCL{C^))) . 

By Lemma A.2 with Bi = {B}, B2 = B{C,C^) \ {C}, Bq = C, Bi = B, and B2 = 
L(C^), there is a probabilistic interpretation Pr over B{B,C) with (12). Hence, it holds 
Pr G Mo{B, C) and Pr{B) > 0. By Lemma B.4 a), we get: 

Pr{BL{C)) = Pr{BL{C^)) = min(l, ^, 1 - + + ^) ■ Pr(5) = 02 ■ Pr(S) 

Pr(L(C)) = Pr(L(Ct)) = ^ . Pr{B) = 72 ■ Pr{B) . 

Let C = GH. We just show b), the claim in c) can be proved analogously. By the in- 
duction hypothesis, there are models Pr 1^1 , Pr2,i G Mo{B, G) and Pr 1^2 , Pr2,2 G Mo{B, H) 
with Pri,i(S) > 0, Pr2,i(5) > 0, Pri,2{B) > 0, Pr2,2{B) > 0, and 

Pri,i{BL{G)) = V2 ■ Pri,i{B), Pri,i{L{G)) = W2 ■ Pri,i{B) 

Pr2,i{BL{G)) = V2-Prxi{B), Pr2,i{L{G)) = W2 ■ Pr2,i{B) 

(16) 

Pri^2{BL{H)) = X2 ■ Pri,2(S), Pri,2{L{H)) = Z2 ■ PnaiB) 

Pr2,2{BL{H)) = X2 ■ Pr2,2{B). Pr2,2{L{H)) = Z2 ■ Pr2,2{B) . 

These conditions already entail V2 < W2, V2 ^ W2, X2 < Z2-, and X2 1^ Z2- With the results 
from a), we additionally get W2 < V2+V2 and Z2 < X2 + X2. By Lemma B.3 b), there is 
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V G [w2 — 1^25 '^'2] and x E [z2 — X2,X2] with (14). By (16), there is Pri G Mo{B,G) and 
Pr2 G Mo{B,H) with Pri{B) > 0, Pr2{B) > 0, and 

Pri{BL{G)) = V ■ Pri(S), Pri{L{G)) = W2 ■ Pri{B) 
Pr2{BL{H)) = X ■ Pr2{B), Pr2{L{H)) = Z2 ■ Pr2{B) . 

By Lemma 3.1, we can choose Pri and Pr2 with Pri{B) = Pr2{B). By Lemma A.l, 
we can choose the probabilistic interpretation Pro over {5, L(G), L(i?)} with PrQ{Ai) = 
Pri{Ai) and Pro(^2) = ^''2(^2) for all atomic events Ai and A2 over {B,L{G)} and 
{5,L(i7)}, respectively, such that: 

Pro(BL{G)L{H)) = mm{Pri(BL{G)), Pr2(BL{H))) 
Pro{BL{G)L{H)) = mm{Pri{BL{G)),Pr2{BL{H))) . 

By Lemma A.2 with Bi = B{B,G) \ {B}, B2 = B{B,H) \ {B}, Bq = B, Bi= L{G), 
and B2 = L{H), there is a probabihstic interpretation Pr over B{B,C) with (12). Hence, 
it holds Pr G Mo{B, C) and Pr{B) > 0. By Lemma B.4 b), we get: 

Pr{BL{C)) = Pr{BL{G)L{H)) = mm{v2, X2) ■ Pr{B) 

Pr{L{C)) = Pr{L{G)L{H)) = m.\.n{w2, Z2,V2 + X2,X2+V2) ■ Pr{B) . □ 

Finally, note that computing least upper bounds is more difficult than computing great- 
est lower bounds, since for each edge S ^ C, by Lemmata 3.1 and B.4 a), the greatest lower 
bound of Pr{BCL{C^))/Pr{B) subject to PreMo{B,C) and Pr{B)>0 is always 0, but 
the least upper bound of Pr{BCL{C^))/ Pr{B) subject to Pr G Mo{B, C) and Pr{B) > is 
generally not 1. 

Appendix C. Proofs for Section 4.2 

In this section, we give the proofs of Theorems 4.7 and 4.8. 

We need some technical preparations as follows. The next lemma helps us to show the 
local soundness of the function in Fusion. 

Lemma C.5 For all real numbers ui,u,vi,v,xi,x,yi,y G (0,1] with ui < u, vi < v, 
xi < X, yi < y, and ui + xi > 1, it holds: 

m.in{u/v — u, x/y — x) / {u + x — 1) < min(?ii/?;i — ui, Xi/yi — xi) / {ui + xi — 1) . 

Proof. The claim can easily be verified (Lukasiewicz, 1996). □ 

The following lemma helps us to show the local soundness and the local completeness 
of the function Hf in Chaining and Fusion. 

Lemma C.6 a) Let u, v, x, and y be real numbers from (0, 1]. For all probabilistic inter- 
pretations Pr with Pr{L{C^)) > 0, the conditions u-Pr{B) = Pr{BC), v-Pr{C) = Pr{BC), 
X ■ Pr{C) = Pr(CZ(C^)), and y ■ Pr{L{C^)) = Pr{CL{C^)) are equivalent to: 
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Pr{BCL(C^')) 
Pr(L(Ct)) 
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h) Let u, V, X, and y he real numbers from (0,1]. For all probabilistic interpretations Pr 
with Pr{B) > 0, the conditions u ■ Pr{B) = Pr{BL{G)), v ■ Pr{L{G)) = Pr{BL{G)), 
X ■ Pr{B) = Pr{BL{H)), and y ■ Pr{L{H)) = Pr{BL{H)) are equivalent to: 



Pr{B L{G) L{H)) 
Pr{B) 


Pr{B L{G)L{H)) 
Pr{B) 
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Proof. The claims can be verified by straightforward arithmetic transformations based on 
the properties of probabilistic interpretations. □ 

We are now ready to prove Theorems 4.7 and 4.8. 

Proof of Theorem 4.7. The claims are proved by induction on the recursive definition of 
Hi- The proof for C = B1B2 ■ ■ ■ B^ with A; > 1 is done for k = 2. It can easily be generalized 
to A; > 2. For C = Bi, we define ui = Pri{C\B), vi = Pri{B\C), xi = Hf{C,C^), and 
yi = Hf(C,C^). Note that ai > entails xi,yi > and vi + xi > 1. For C = B1B2, 
we define G = Bi, H = B2, ui = Hf'{B,G), vi = Hf{B,G), xi = Hf{B,H), and 
yi = Hf{B,H). Note that ai > entails ui,vi,xi,yi > and ui + xi > 1. 

a) All models Pr G Mo{B,C) with Pr{L{C)) = satisfy the indicated condition. In the 
sequel, let Pr G Mo{B,C) with Pr{L{C)) > and thus also Pr{B) > 0. 

Basis: Let C = B. Since C = L{C), we get: 

61 ■ Pr{L{C)) = 1 ■ Pr{L{C)) = Pr{CL{C)) = Pr{BL{C)) . 

Induction: Let C = Bi. For aU models Pr2 G Mo(C, C^), we get xi-Pr2{C) < Pr2{CL{C^)) 
by Theorem 4.3 a), and yi ■ Pr2{L{C^j) < Pr2{C L{C^j) by the induction hypothesis. Thus, 
Pr satisfies the same conditions. Since L(C^) = L{C) and by Lemmata A.l and C.6 a): 

^1 = ?/i-f7 + ^ < Pr{CL{C^)) / Pr{L{C^)) = Pr{CL{C)) / Pr{L{C)) . 

Let C = GH. For all models Pn G Mo{B,G) and Pr2 G Mo{B,H), we get by 
Theorem 4.3 a) and by the induction hypothesis, respectively: 

ui-Pri(B) < Pri(BL(G)), xi ■ Pr2{B) < Pr2(BL{H)) 
vi-Pri{L{G)) < Pri{BL{G)), yi ■ Pr2{L{H)) < Pr2{BL{H)) . 
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Hence, Pr satisfies the same conditions. Since L{G)L{H) = L{GH) = L{C) and by Lem- 
mata A.l, C.5, and C.6 b), we then get: 

. _ . I minim/vi-uuxi/yi-xi) . . -, / Pr{BL{G)L(H)) / Pr{B) . _ Pr{BL{C)) 

01 — 1/1,1-^ ui+xi-l ) - ^ I \^ ^ Pr{BL{G)L{H))IPr{B)> ~ Pr{L{C)) ' 

b) 

Basis: Let C = B. A model Pr G Mo{B,C) such that Pr{B) > 0, Pr{L{C)) > 0, 
1 • Pr(L(C7)) = Pr{BL{C)), and 1 • Pr(5) = Pr{BL{C)) is given by ^ 0, L 

Induction: Let C = Si. Let the model Pn of {(C|i?)[?xi, (5|C)[?;i, ui]} with Pri(S) >0 
and Pri{C) > be defined like in the proof of Theorem 3.2. 

By the induction hypothesis, there is Pr2 G Mo(C, C^) with Pr2(C) > 0, Pr2{L{C^)) > 0, 
yi • Pr2iL{C^)) = Pr2(CL(Ct)), and xi ■ Pr2{C) = Pr2(CL(Ct)). By Lemma 3.1, we can 
choose Pn and Pr2 such that Pri(C) = Pr2(C) and Pri(BC) > Pr2(CL{C^)). By Lem- 
mata A.l and C.6 a), we can choose the probabilistic interpretation Pro over {B, C, L{C^)} 
with Pro(^i) = Pri{Ai) and Pro(^2) = -^?"2(^2) for all atomic events Ai and A2 over 
{B,C} and {C, L(C^)}, respectively, such that: 

ProiBCL{C^)) = max{0, Pr2iCL{C^)) - Pri(BC)) = 
Pro{BCL{C^)) = max(0,Pr2(CL((7t)) -Pri(S(7)) . 

By Lemma A.2 with Bi = {B}, B2 = B{C, C^)\{C}, Bq = C,Bi = B, and B2 = L{C^), 
there exists a probabilistic interpretation Pr over B{B,C) with (12). Hence, it holds Pr G 
Mo(S,C), Pr{B) > 0, and Pr[L{C)) > 0. Moreover, by Lemma C.6 a), we get: 

Si = + ^ = Pr{CL{C^)) I Pt{L{C^)) = Pr{CL{C))/Pr{L{C)) 

ai = ^^1-^ + ^ = Pr{CL{C^)) / Pr{C) = Pr{CL{C)) / Pr{C) . 

Let C = GH. By the induction hypothesis, there are models Pri G Mo{B,G) and 
Pr2 G Mo{B,H) with Pri{B) > 0, Pr2(5) > 0, Pri(L(G)) > 0, Pr2{L{H)) > 0, and 

ui ■ Pri{B) = Pri{BL{G)), ■ Pr2{B) = Pr2{BL{H)) 

VI ■ Pri{L{G)) = Pri{BL{G)), yi ■ Pr2{L{H)) = Pr2{BL{H)) . 

By Lemma 3.1, we can choose Pri and Pr2 with Pri{B) = Pr2{B) and Pri{B L{G)) > 
Pr2{BL{H). By Lemmata A.l and C.6 b), we can choose the probabilistic interpretation 
Pro over {B, L{G), L{H)} with Pro(^i) = Pri(^i) and Pro(^2) = ^^2(^2) for all atomic 
events Ai and A2 over {B,L{G)} and {B,L{H)}, respectively, such that: 

Pro(BL{G)L{H)) = mm{Pr2(BL{H)), Pri(BL{Gj)) 
Pro{BL{G)L{H)) = max{0, Pr2{BL{H)) - Pri{BL{G))) . 

By Lemma A.2 with Bi = B{B, G) \ {B}, B2 = B{B, H) \ {B}, Bq = B, Bi = L{G), and 
B2 = L{H), there exists a probabilistic interpretation Pr over B{B,C) with (12). Hence, 



236 



Probabilistic Deduction with Conditional Constraints over Basic Events 



it holds Pr G Mo{B, C) and Pr{B) > 0. By Lemma C.6 b), we get Pr{L{C)) > and 

X _ 1 / /-I I mm(«iM-«i,xi/;/i-xi) x _ . / Pr(BL{G)L{H)) / Pr{B) ^ _ Pr{BL{C)) 

01 — ui+ii-l ~ ^ I ^ Pr{BL{G)L{H))IPr{B)> ~ Pr(L(C)) 

ry, - + _ 1 - Pt(BL(G)L(H)) _ PrjBLjC)) 

c) Let C = GH. By Theorem 4.3 b), there exist Pri G Mo{B, G) and Pr2 e Mo{B, H) with 
Pri(B) > 0, Pr2{B) > 0, Ml • Pri(S) = Pri{BL{G)), and a;i • Pr2(S) = Pr2{BL {H)). By 
Lemma 3.1, we can choose Pri and Pr2 such that Pri{B) = Pr2{B) and Pri{B L{G)) > 
Pr2{BL{H). By Lemmata A.l and C.6 b), we can choose the probabilistic interpretation 
Pro over {B, L{G), L{H)} with Pro(^i) = Pri{Ai) and Pro(^2) = ^^-2(^2) for all atomic 
events Ai and over {B,L{G)} and respectively, such that: 



Pro{BL{G)L{H)) = max{0, Pr2{BL{H) - Pri{B L{G))) = 



Pro{BL{G)L{H)) = max{0, Pr2{BL{H)) - Pri{BL{G))) . 

By Lemma A.2 with Bi = B{B,G)\{B}, B2 = B{B,H)\{B}, Bq = B,Bi = L{G), and 
B2 = L{H), there exists a probabilistic interpretation Pr over B{B,C) with (12). Hence, 
it holds Pr G Mo{B, C) and Pr(5) > 0. By Lemma C.6 b), we get Pr{L{C)) > and 

1 = Pr{BL{G)L{H)) / Pr{L{G)L{H)) = Pr{BL{C)) / Pr{L{C)) 

ai = Pr{BL{G)L{H)) / Pr{B)) = Pr{BL{C)) / Pr{B) . 

d) Let C = GH. By Theorem 3.2, there is a model Pr'" G Mo{B, C) with Pr"'{BL{C)) > 0. 

By Theorem 4.3 b), there is a model Pr" G Mo{B, C) with Pr"{B) > and • Pr"{B) = 
ai ■ Pr"{B) = Pr"{BL{C)). Hence, there is a model Pr' G Mo{B, C) with Pr'{B) > and 

minis, Pr'"{BL{C)) / Pr'"{B)) ■ Pr'{B) = Pr'{BL{C)) . 

Let the models Pri G Mo{B, G) and Pr2 G Mo{B, H) be defined by Pri(^i) = Pr'(^i) and 
Pr2(^2) = P'"'(^2) for all atomic events Ai and A2 over B{B, G) and B{B, H), respectively. 
By Lemma 3.1, we can choose Pri and Pr2 such that Pri{B) = Pr2{B) and Pri{B L{G)) > 
Pr2{BL{H). By Lemmata A.l and C.6 b), we can choose the probabilistic interpretation 
Pro over {B, L{G), L{H)} with Pro(^i) = Pri(Ai) and Pro(^2) = Pr2(^2) for all atomic 
events Ai and A2 over {B,L{G)} and {B, L{H)}, respectively, such that: 



Pro{BL{G)L{H)) = inax{0, Pr2{BL{H) - Pri{B L{G))) = 

Pro{BL{G)L{H)) = min(e, Pr'"{BL{C)) / Pr'"{B)) ■ Pro(S) . 

By Lemma A.2 with Bi = B{B,G) \ {B}, B2 = B{B,H) \ {B}, Bq = B, Bi = L{G), and 
B2 = L{H), there is a probabilistic interpretation Pr over B{B, C) with (12). Hence, it holds 
PreMo{B,C), Pr{B)>0, Pr{L{C))>0, Pr(BL{C)) = 0, and e ■ Pr{B)>Pr{BL{C)). □ 

Proof of Theorem 4.8. For ui > 0, the claim is immediate by Theorem 4.7 a) to c). 

Let ui=OandE ^ F. It holds 1-Pr(£;) = Pr{EF) for all models Pr of KB. Moreover, 
by Theorem 3.2, there exists a model Pr of KB with Pr{E) > 0. 

Let ui = and not E ^ F. By Theorem 4.3 b), there exists a model Pr of KB 
with Pr{E) >0 and Pr{EF) = 0. By Theorem 4.7 d), there exists a model Pr of KB with 
Pr{E) > and 1 • Pr{E) = Pr{EF). □ 
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Appendix D. Proofs for Section 4.3 

In this section, we give the proof of Theorem 4.9. 

The next lemma will help us to show the global tightness of the computed lower bound 
in the case (3) of Theorem 4.9 b). 

Lemma D.7 Let x G [0,1] andv,x G [0,cx)). For all probabilistic interpretations Pr with 
Pr{G) > 0, the conditions Pr{EG) = 0, v ■ Pr{G) = Pr{EG), x ■ Pr{G) = Pr{GF), and 
X ■ Pr[G) = Pr[GF) are equivalent to: 



Pr{EGF) 
Pr{G) 

Pr(EGF) 

Pr(a) 


Pr{EGF) 
Pr{G) 

Pr{E'GF) 

PriG) 


Pr{EG) 
Pr(G) 

V 


Pr{EGF) 

Pr(G) 

Pr(EGF) 
Pr{G) 


Pr{EGF) 
Pr(G) 

Pr(EGP) 
Pr{G) 


1 




Pr(GF) 
Pr{G) 


X 




l-x 


x 





Proof. The claim can be verified by straightforward arithmetic transformations based on 
the properties of probabilistic interpretations. □ 

We are now ready to prove Theorem 4.9. 

Proof of Theorem 4.9. a) By the definition of queries to conditional constraint trees, all 
paths from a basic event in _E to a basic event in F have at least one basic event in common. 
Hence, we can choose the basic event G from all such basic events in common such that 
3(G|£^)[2;i, 2^2] is a strongly conclusion-restricted complete query to a subtree. 

b) For ui > 0, the claim follows from Theorem 4.7 a) to c). For the special case of exact 
conditional constraint trees {B^KB), the claim then follows from Theorems 4.3 and 4.5. 

Let ui = 0, ui = 1, and G ^ F. It holds 1 ■ Pr{E) = Pr{EF) for all models Pr of KB. 
Moreover, by Theorem 3.2, there exists a model Pr of KB with Pr{E) > 0. 

Let ui = 0, vi = 0, and G ^ F. It is easy to see that by (1) and Theorem 4.7 d), the 
tight upper answer is given by {a;2/l}. We now show that the tight lower answer is given by 
{si/O}. By Theorem 4.3 b), there exists a model Pri oiKBi withPri(£;) > 0, Pri{G) > 0, 
and Pri{EG) = 0. By Theorem 3.2, there exists a model Pr2 of KB 2 with Pr2{G) > 0. By 
Lemma 3.1. we can choose Pri and Pr2 with Pri{G) = Pr2{G) and Pri{EG) > Pr2{GF). 
By Lemmata A.l and D.7, we can choose the probabilistic interpretation Pro over {E, G, F} 
with Pro{Ai) = Pri{Ai) and Pro(^2) = ^''2(^2) for all atomic events Ai and A2 over 
{E,G} and {G,F}, respectively, such that: 

ProiEGF) = max(0,Pr2(GF) - Pri(EG)) = 
Pro{EGF) = . 

By Lemma A. 2, there exists a probabilistic interpretation Pr over B with (12) for all atomic 
events Hq, Hi, H2, Ai, and A2 over the sets of basic events {G}, {E}, {F}, Bi \ {G}, and 
B2 \ {G}, respectively. Hence, Pr is a model of KB with Pr{E) > and Pr{EF) = 0. 
For ui = and not G ^ F, the claim follows from (1) and Theorem 4.7 d). □ 
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